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FOREWORD 


This book grew out of a course of lectures on functional analysis taught over many 
years to second-year graduate students at the Courant Institute of New York Uni¬ 
versity. It is a graduate text, not a treatise or a monograph. Most of the chapters are 
short, for it is easier to digest material in small chunks. Not all topics can be pre¬ 
sented briefly, so some of the chapters are longer. Theorems and lemmas, as well as 
equations, are numbered consecutively in each chapter. 

The first 23 chapters make only a modest technical demand on the reader; this 

material would serve very well as text for an introductory graduate course on func¬ 

tional analysis. The rest of the material is well suited as text for a more advanced 
graduate course on functional analysis in general, or on Hilbert space in particular. 

When I was a student，the only text on functional analysis was Banach’s original 
classic, written in 1932; Hille’s book appeared in time to serve as my graduation 
present. For Hilbert space there was Stone’s Colloquium publication, also from 1932, 
and Sz.-Nagy’s Ergebnisse volume. Since then, our cup hath run over; first came 
Riesz and Sz.-Nagy, then Dunford and Schwartz, Yosida, later Reed and Simon, and 
Rudin. For Hilbert space, there was Halmos’s elegant slender volume, and Achiezer 
and Glazman, all of which I read with pleasure and profit. Many, many more good 
texts have appeared since. Yet I believe that my book offers something new: the 
order in which the material is arranged, the interspersing of chapters on theory with 
chapters on applications, so that cold abstractions are made flesh and blood, and 
the inclusion of a very rich fare of rnatfiematical pr661e"ms" that can be clarified and 
solved from the functional analytic point of view. 

In choosing topics I heeded the warning of my teacher Friedrichs: “It is easy to 
write a book if you are willing to put into it everything you know about the sub- 

ject.” I present the basic structure of the subject, and those more advanced topics that 
loom large in Spectral resolution and 

spectral representation of self-adjoint operators, the theory of compact operators, the 
Krein-Milman theorem, Gelfand’s theory of commutative Banach algebras, invari¬ 
ant subspaces, strongly continuous one-parameter semigroups. I discuss the index 
of operators, so important in calculating topological invariants; the celebrated trace 
fomlula of Lidskii, a powerful tool in analysisriSe"FredhoIrh determmant and its 
generalizations, rising again after almost a hundred years of hibernation; and scatter- 
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FOREWORD 


ing theory, another gift from physics to mathematics. I have also included some (but 
not all) special topics close to my heart. 

"What has been omitted? All of nonlinear functional analysis, for which I recom¬ 
mend the four-volume treatise by Zeidler. Operator algebras, except for Gelfand，s 
theory of commutative Banach algebras. I slight the geometric theory of Banach 
spaces; happily a.handbook-on-this-subject, edited-by Bill Johnson and Joram Lin- 
denstrgu gsjs t g) bg by Holland. 

Wfiat are the prerequisites? What every second-year graduate student — and many 
undergraduates —— knows: ^ : 一 — 

® Naive set theory. Denumerable sets, the continuum, Zorn’s lemma. 

•: Linear algebra. The alternative for linear maps, trace and determinant of a 
matrix, the spectral theory of general* and symmetrib matrices, functions of a 
matrix. 

• Point set topology. Complete metric spaces, the Baire category principle, Haus- 
dorff spaces, compact sets, Tychonov，s theorem. 

• Basic theory of functions of a complex variable. 

• Real variables. The Arzela-Ascoli theorem, the Lebesgue decomposition of 
measures on M, Borel measure on compact spaces. 

It is an accident of history that measure theory was invented before functional 
analysis. The usual presentations of measure theory fail to take advantage of the con¬ 
cepts and constructions of functional analysis. In an appendix on the Riesz-Kakutani 
representation theorem I show how to use the tools of functional analysis in measure 
theory. Another appendix summarizes the basic facts of Laurent Schwartz 9 s theory 
of distributions. 

Many of the applications are to problems of partial differential equations. Here a 
nodding aquaintance with the Laplace and the wave equation would help, although 
an alert uninformed reader could pick up so^ 

Like most mathematicians, I am no historian. Yet I have included historical re- 
marks in some of the chapters, mainly where I had some firsthand knowledge, or 
where conventional history has been blatantly silent concerning the tragic fate of 
many of the founding fathers of functional analysis during the European horrors of 
the 1930s and 1940s. 

Iam indebted to many. I learned the rudiments of functional analysis, and how 
to apply them, from my teacher Friedrichs. Subsequently my views were shaped by 
the work of Tosio Kato, who has brought the power of functional analysis to bear 
on an astonishing range of problems. My happy and long collaboration with Ralph 
Phillips has led to some unusual uses of functional analysis. I learned much from 
Israel Gohberg, especially about the index of Toeplitz operators, from Bill Johnson 
about the fine points of the geometry of Banach spaces, and from Bob Phelps about 
Choquet’s theorem. I thank Reuben Hersh and Louise Raphael for their critique of 
the appendix on distributions, and Jerry Goldstein for his expert comments on the 










material on semigroups and scattering theory. To all of them, as well as to Gabor 
Francsics, my thanks. 

Jerry Berkowitz and I alternated teaching functional analysis at the Courant In¬ 
stitute. This would be a better book had he lived and looked the manuscript over 
critically. 

I thank Jeff Rosenbluth and Paul Chemoff for a careful reading of the early chap¬ 
ters, and Keisha Grady for Taxing the manuscript, and cheerfully making subsequent 
changes and corrections. 

The lecture course was popular and successful with graduate students of the 
Courant Institute. I hope this printed version retains the spirit of the lectures. 


Peter D. Lax 






LINEAR SPACES 


A linear space X over a field F is a mathematical object in which two operations 
are defined: addition and multiplication by scalars. 

Addition, denoted by +, as in 




is assumed to be commutative. 


associative ， 


x-\-y = j 

x-h(y + z) == {x + y) + ^ 


and to forma group, with the neutral element denoted as 0: 

jc + 0 = x. 


⑴ 


( 2 ) 


(3) 


⑷ 


The inverse of addition is denoted by -： 

x + (-x) = x - x = 0, (5) 


The second operation is the multiplication of elements of X by elements k of the 
field F: 


kx. 


MilRpdby elements 

of F is assumed to be associative, 

k(ax) = (ka)x, (6) 

Qnd'disfribiifive y 


k(x + y) = kx + ky 


⑺ 









as well as 
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(a + b)x = ax + bx. (8) 

We assume that multiplication by the unit of F, denoted as 1, acts as the identity: 


These are the axioms of linear algebra. From them proceed to draw, some deduc- 
tions. 

Setb = 0 in (8). It follows that for all x. 


Ox = Q. ’• (10) 

Seta = = -1 in (8). Using ⑼ and (10), we deduce that for all x, 

(一 l)d (11) 

The finite-dimensional linear spaces are dealt with in courses on linear algebra. In 
this book the emphasis is on the infinite-dimensional ones—those that are not finite- 
dimensional. The field F will be either the real numbers M or the complex numbers 
C. Here are some examples. 

Example L X is the space of all polynomials in a single variable with real coeffi¬ 
cients, here F = M. 

Example 2. X is the space of all polynomials in N variables 勾， … ， w，with real 
coefficients, here F = M. 

Example 3. G is a domain in the complex plane, and X the space of all functions 
complex analytic in G，here F = C. 

Example 4. X = space of all vectors 

x = 01 ，处，…） 

with infinitely many real components, here F = R. 

Example 5. 2 is a Hausdorff space, X the space of all continuous real-valued func¬ 
tions on 2, here F = M. 

Example 6. M is a C°° differentiable manifold, X = C°°(M), the space of all 
differentiable functions on M. 

Example 7. 2 is a measure space with measure m，X = L 1 (Q, m). 
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Example 8. X = L P (Q, m). 

Example 9. X = harmonic functions in the upper half-plane. 

Example 10. X = all solutions of a linear partial differential equation in a given 
domain. 

Example 11, All meromorphic functions on a given Riemann surface; F = C. 

We start the development of the theory by giving the basic constructions and con¬ 
cepts. Given two subsets S and r of a linear space X, we define their sum, denoted 
as 5 + T to be the set of all points x of the form x = 3 ； + z, 3 ; in 5, z in T. The 
negative of a set 5, denoted as 一 5, consists of all points x of the form a: = y 
in 5. 

Given two linear spaces Z and JJ over the same field, their direct sum is a linear 
space denoted as Z ㊉ i/，consisting of ordered pairs {z, w)» z in Z, u in U. Addition 
and multiplication by scalars is componentwise. 

Definition. A subset F of a linear space X is called a linear subspace of X if sums 
and scalar multiples of Y belong to Y. 

Theorem 1. 

(i) The sets {0} and X are linear subspaces ofX. 

(ii) The sum of any collection of subspaces is a subspace. 

(iii) The intersection of any collection of subspaces is a subspace. 

(iv) The union of a collection of subspaces totally ordered by inclusion is a sub- 
space. 

Exercise 1. Prove theorem 1. 
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Proof. Part (i) is merely a rephraisiiiig of th6 definition Of linear span. To prove 
part (ii)，we remark that on the one hand, the elements of the forn3L(12)«fmxtaJinfeaL_ 
subspace of X; on the other hand, every x of form (12) is contained in any subspace 
Y containing 5. 

Remark 1. An element x of form (12) is called a linear combination of the points 
x\ y ...,x n .So theorem 1 can be restated as follows:. _ 一 __ 一 一一 

linear span of a subset S of a linear space consists of all linear combinations 
of elements ofS. 

Definition. X a linear space, Y a linear subspace of X, Two_points jr.j_.and A* 2 - 0 f.X _ 
are called equivalent modulo 7 , denoted sls>x\ = X 2 (mod Y\ if x\ - X 2 belongs 
toy. — 。 


each equivalence 



As with all algebraic structures，so with linear structures we have the concept of 
isomorphism. 

Definition. Two linear spaces X and Z over the same field are isomorphic if there 
is a one-to-one corresporidence T carrying one into the other that maps sums into 














Definition. X is a linear space over the reals; a subset iiT of X is called convex if, 
whenever 工 and y belong to K, the whole segment with endpoints x,y, meaning all 
points of the form 

ax + (l -«)y, 0 <a < 1, (15) 

also belong to K. 

Examples of convex sets in the plane are the circular disk, triangle, and semicir¬ 
cular disk. The following property of convex sets is an immediate consequence of 
the definition: 、 


Theorem 4. Let K be a convex subset of a linear space X over the reals. Suppose 
that x\ ， … ， x n belong to K; then so does every x of the form 


(16) 

E a ) = L 

I 

Exercise 次一 Pfove theorem 4 ； … " 



An 又 of form (16) is called a convex combination of ^i, X 2 ,... T x n . 








Theorem 5. Let X be a linear space over the reals. 


(i) The empty set is convex. 

(ii) A subset consisting of a single point is convex. 

(Hi) Every linear subspace of X is convex. 

(iv) Thesum of Wo convex_subsetsJs convex, _ _ _ 

(v) If K is convex, so is —K. _^_ 

(vi) The intersection of an arbitrary collection of convex sets is convex. 

(vii) Let [Kj] be a collection of convex subsets that is totally ordered by inclusion. 
Then their union UKj is convex. 

(ix) The inverse image of a convex set under a linear map is convex. 

Exercise 5. Prove theorem 5. 


Definition. Let «S be any subset of a linear space X over the reals. The convex hull 

of 5 is defined as the intersection of all convex sets containing 5. The hull is denoted 

as S. 

Theorem 6. 


(i) The convex hull ofSis the smallest convex set containing S. 

(ii) The convex hull of S consists of all convex combinations (16) of points of S. 

Exercise 6. Prove theorem 6. 

Definition. A subset £ of a convex set K Is called an extreme subset of K if: 

"(i)--』is convex and nonempty. 

(ii) whenever a point x of E is expressed as 

y + z • 

X = --y-, }\ z m K, 

then both y and z belong to E. 

An extreme subset consisting of a single point is called an extreme point of K. 
Example 1. K is the interval 0 < x < 1; the two endpoints are extreme points. 
Example 2. K is the closed disk 
• Jc 2 + j 2 < 1. 

Every point on the circle x 2 + > ,2 = 1 is an extreme point. 








Example 3. The open disk 


x 2 + y 2 < l 


has no extreme points. 

Example 4. K a. polyhedron, including faces. Its extreme subsets are its faces, edges, 
vertices, and of course K itself. 


Theorem 7. Let K be a convex set, E an extreme subset of K ， and F an extreme 
subset of E, Then F is an extreme subset of K. 

Exercise 7. Prove theorem 7. 


Theorem 8 . Let M be a linear map of the linear space X into the linear space U. 

Let K be a convex subset ofU, E an extreme subset of K. Then the inverse image of 

E is either empty or an extreme subset of the inverse image of K, 

Exercise 8. Prove theorem 8. 


Exercise 9. Give an example to show that the image of an extreme subset under a 
linear map need not be an extreme subset of the image. 

Taking U to be one dimensional, we get 

Corollary 8 ; . Denote by H a convex subset of a linear space X, l a linear map 
of X into R t H m \ n and H max the subsets of H, where i achieves its minimum and 
maximum, respectively. 


Assertion. When nonempty, H m i n and H m3tX are extreme subsets of H. 
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LINEAR MA PS— 


2.1 ALGEBRA OF LINEAR MAPS 

We recall from chapter 1 that a linear map from one linear space X into another, U 9 
both over the same field of scalars, is a mapping of X into t /， 

M:X—^U, 

that is an algebraic homomorphism: 

MU + )0 = M(x) + M(j), 

M(kxY=kM(x). (1) 

In this section we explore those properties of linear maps that depend on the purely 
algebraic properties (1)，without any topological restrictions imposed on the spaces 
X, £/. 

The sum of two linear maps M and N of Z into U, and the scalar multiple is 
defined as 

(M + N)(x) = M(jc) + N(jc), (2) 

(kM)(x) = kM(x), (3) 

This makes a linear space out of the set of linear maps of X into U. The space 
is denoted as C(X, U). Given two linear maps, one, M from X U, the other, N 
from £/ -> W, we can define their product as the composite map 

(NM)(jc) = N(M(jc)). (4) 

Since compositon of maps in general is associative, so is in particular the composi¬ 

tion of linear maps. As we will see, composition is far from being commutative. 
From now on we omit the bracket and denote the action of a linear map on x as 

M(x) = Mjc. 

This notation suggests that the action of M on x is a kind of multiplication; indeed 


property 














The range of M, denoted by Rm, 
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We formulate now a number of useful notions and results concerning maj^ingfi 
M:X—^X. 

We denote bv Ni the nullsvace of the /th nower of M ： 


N J C N J+ i for all j ⑹ 



Proof. Equation (6) is an immediate consequence of (5). To show (7)，we claim 
that M maps Nj+\/Nj into Nj/Nj^\ in a one-to-one fashion. To see this, note that 
a nonzero element of Nj + \/Nj is represented by a point z in Nj+i that does not 
lie in Nj. Clearly, Mz lies in Nj but not in Nj -\； this shows the one-to-oneness. 
It follows that Nj + \/Nj is isomorphic to a subspace of Nj/Nj^u from which the 
statement (7) about dimension follows. When Nj + \/Nj is infinite-dimensional, so 
is Nj/Nj^ h — ，口 

The following is an immediate corollary of equation (7): 

Theorem 2\ Suppose that for some i the subspaces defined by (5) satisfy 

Ni = N w ; (8) 

then 

Nj = Nk for all k > i. (8 ，） 

Definition, A subspace F of X is called an invariant subspace of a linear map M: 
X XifM maps Y into Y. 

Theorem 3. Suppose that Y is an invariant subspace of X for a mapping M: A 7 —> X. 
Then 

(i) there is a natural interpretation of M as a mapping X/Y X/Y. 

(ii) if both maps 

M: y —» YandM: X/Y —» X/Y 


are invertible, so isM: X X. 







we remarK inai wnereas mveniDmiy oi ivi on i ant 

ibility of M on X, the converse by no means holds in i 

For example, let X be the space of all bounded continuo 
_operator. . .. 一 


(Sx)(t) = x(t 













Theorem 4. Let M be a linear map: X H 


(i) For any y in X， the set {/?(M))，}, where p represents any polynomial, is an 
invariant subspace ofM. 

(ii) Let T be a linear map: X X that commutes with M : TM = MT. Then the 
nullspace of T is an invariant sulTspaceofM, 


_ : _ Proof. Part (i) rests on the observation that if p(M) is a polynomial, so is M p (M)_ 

Part (ii) follows from the observation that if M and T commute, and if z is in the 
• nullspace'ofT : Tz = 0, tKen TMz = MTz = MO = 0.. □ 

— —~ 22WDEX OFALINEAR MAP ， 

The next group of theorems describe an important special class of mappings. 
Definition. A linear map G is called degenerate if its range is finite dimensional: 

dim Rg < oo. (10) 

Theorem 5. The degenerate maps form an ideal in the following sense: 

(0 The sum of Wo degenerate maps is degenerate. 

⑹ The product of a degenerate map with any linear map, in either order, is de¬ 

generate; that is, if G is degenerate, so are MG and GN，provided of course 
that the products can be defined. 

Exercise 4. Prove theorem 5. 

Definition. The linear maps M : X —» L/ and h : U X are pseudoinverse to each 
other if 

LM = I + G, ML = I + G ， (11) 

where I denotes the identity, G degenerate maps of X Z, and U U, respec¬ 
tively. 


Exercise 5. Prove that the right shift and the left shift described after theorem 1 are 
pseudoinverses of each other on the space of all sequences. 

Theorem 6. 

(i) IfL and M are pseudoinverses of each other, so are L -h Gj and M + G 2 , 
where Gj t G 2 are arbitrary degenerate maps. 

⑼ Suppose that M : X U and A: U W have pseudoinverses L and B, 
respectively. Then AM and LB are pseudoinverse to each other. 







Exercise 6. Prove theorem 6. 


We recall the definition of codimension of a subspace of a linear space U: 


codim R = dim(U/R). 


Theorem 7. A linear map M:X U has a pseudoinverse if and only if 
dim Nm < oo, codim Rm < oo. 

Proof. For the “only if，part we use a lemma: 

Lemma 8. If G is a degenerate map ofX -> X t then 

dim N\^.q < oo, codim Ri^q < 

Proof. For;c in M+g ， 

x + G^: = 0. 

This shows that 

^I+G C Rq \ 

combined with (10) this shows the first part of (13). 

According to theorem 1 (iii), G maps X/Nq one-to-one onto Rq\ so 

codim Nq = dim Rq. ( 14 ) 

Obviously I + G maps every x in Nq into itself; this shows that i?i +G d Nq. It 
follows from this relation that 

__ c odim Ri + 9 q < co dim Nq . — (14') 

Combining (14) and (14’)，we conclude that codim /? I+G < dim R G ; using (10), we 
deduce the second part of (13). □ 


__Suppose now that M has a pseudoinverse; then (11) holds. From the first relation 

If Cl 0 wTdeduS^tKan^'c ^ 

this with the first part of (13), we obtain the first part of (12), It follows from the 
second relation in (11) that 〕及 i+g. Therefore 

_codim Rm < codim _ 


Combining this with the second relation in (13), we deduce the second part of (12). 


( 12 ) 


(13) 























Theorem 10. ! 







1 subspace of A^lm. Q is the natural map of points 
oiU mod Ru containing them. E is the mapping 
?LM into equivalence classes mod Rl, 


Exercise 9. Verify that (22) is an exact sequence. 

We apply relation (20') to the exact sequence (22), with 

Vb = 0, V] = Nu, Vl = Nim, ^3 = 

V 4 = U/Ru, Vs = V6 = W/R hi V 1 = 0. 

Using the definition of codimension, we can write (20’) as follows: 

dim Nu - dim Nlm + ciimA^L - codim /?m + codim ~ c ? dim = 0. 

Using the definition (18) of the index, we deduce the product formula (19) for the 
index. ^ 

The next result is called the stability of index: 

Theorem 12. LetM : X U be a linear map with a pseudoinverse, and G : X 
JJ a degenerate linear map. Then M + G has a pseudoinverse, and 

ind(M + G) = indM. (23) 

Proof. We first verify (23) for [/ = X and M = I. For this we need a lemma: 


tee, andK : X — U a linear map ofX into U that 
linear subspace ofX that has finite codimension. 
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Then Kq : Xq U, the restriction ofKto Xq, has a pseudoinvef.se，and 

indKo = indK - codim Xq. (24) 

Proof. Factor K。as 

Ko = KIo, (24 ; ) 

where Io : Xo X is the identification map. Clearly Ni 0 = {0}, R\ 0 = Xq, so 

indlo = -codim Xq. (25) 

Now we apply the product formula (19) to (24’）and deduce (24). 口 

Let G : X X be a degenerate map; take K : X -> Z to be 

K = I + G. (26) 

Clearly, I is a pseudoinverse to K. Take Xq to be the nullspace of G: 

Xo = N g , (27) 


By (14), Xq has finite codimension. Since G is zero on Xq, Ko, the restriction of K 
to Xq, is the identification map Iq. So by (25), 

indKo = indlo = —codim Xq. 

We apply now lemma 13 to K. By (24), 

indK q = indK — codim Xq. 

We deduce from the last two relations that 

ind K = 0 (28) 

for every K of form (26). This proves (23) for M = I. 

We take now M as any map with a pseudoinverse; denote by L : t/ X a 
pseudoinverse of M.. By definition, 

_ ~ 

- G’degenerate. So by (28)，- 

ind (LM) = ind (I + G，) = 0. (29) 

Using the product formula (19)?-we get- from-(-29-)-t-hat - 


indL = -ind M. 


(30) 


: c. 





As we saw in theorem 6 (i), for degenerate G，L is also a pseudoiriverse of M + G. 
Therefore, using (30)，once more we deduce that 

indL = _ind(M + G). (30') 

_Combining..(30) and (3.00, we get (23). 

Not ps_ 

The first part of this chapter is standard fare* The nonstandard, items areas follows: 

(i) The notion of the index of linear maps that have a pseudoinverse, theorem 7. 

(ii) The product formula for the index, theorem 10. 

(iii) The invariance of the index under perturbation by degenerate maps, theo¬ 
rem 12. 

Strange to say, these results of linear algebra were first discovered in the setting 
of bounded maps of normed linear spaces. That they hold without any topological 
assumptions has remained a folk theorem. The first statement and proof of the multi¬ 
plicative property in print is due to Donald Sarason. The proof presented here, using 
exact sequences, is due to Sergiu Klainerman. 
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THE HAHN-BANACH 
THEOREM 


3.1 THE EXTENSION THEOREM 

The result named in the title of this chapter is remarkable for its simplicity and for 
its far-reaching consequences. It deals with the extension of linear functionals. 

Definition. A linear functional € is a mapping of a linear space X over a field F into 
F, that is linear: 


for all x, yinX and 


e(x + y)^l(x) + e(y) 

t(kx) = ki{x) 


for all k in F. . 

In this section we will mainly deal with linear spaces over the field of .reals, and 
real number valued linear functionals.. 

Theorem 1 (Hahn-Banach Theorem). Let X be a linear space over the reals, and 
pa real-valued function defined on X, which has the following properties: 

— (fy-Posithre^homogeneity, -- 

p{ax) = ap{x) for all a > 0 (1) 

for every x in X. 

(ii) Subadditivity, 


Pix + y)< p(x) + p(y) 


( 2 ) 


Y denotes a linear subspace of X on which a linear functional t is defined that is 
dominated by p: _ .. _ 

^00 < p{y) for all y in Y. (3) 

Assertion, i can be extended to all of X as a linear functional dominated by p: 

t(x) < p(x) for all x in X. •一 二一一，一 13 / 广， 

Proof. Suppose that Y is not all of X; then there is some ^ in. X that is not in Y. 
Denote by Z the linear span of Y and z，meaning all points of the form 

+ az, … )，-in Y; g in.M. ……_ _ :- 

Our aim i$ to extend £ as a linear functional to Z so that (30 is satisfied for x in Z. 
that is, 

£(y + az) = t{y)-¥ ai{z) < p(y + az) 

holds for all j in 7 and all real a. By (3), the inequality holds for a = 0. Since p is 


positive homogeneous, it suffices to verify it for a = 士 1 : 

i(y) + m< p(y + zh . 

Thus for all y,y f in 7, 

€(/) - p(y' ~z)< < p(y + z) - £(y) ⑷ 

must hold. Such an £(z) exists iff for all pairs y, y\ 

^iy') - piy' - z) < piy + z) - ^00- (5) 

This is the same as 

i(y') + i(y) = i{y' + y) < p(y + z) + p(y' - z). (S') 

Since }> + lies( in 7, (3) holds: 

^■{y' + y) < piy + y'). (6) 

By subadditivity, 

p{y 4- y f ) = p{y + Z + / ~ 2) < p(y + z) + 〆〆 一 Z). (7) 


Combining ( 6 ) and (7) gives (5’)，proving the possibility of extending l from Y to Z. 
So (30 remains satisfied. 

Consider all extensions of i to linear spaces Z containing Y on which inequality 
(30 continues to hold. We order these extensions by defining 

to mean that Z ! contains Z, and that f agrees with i on Z. 
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Let {Z y , £ v } be a totally ordered collection of extensions oft. Then we can define 
i on the union Z = UZ V as being on Z v . Clearly, i on Z satisfies (3 ’)； equally 
clearly, (Z Vl t v ) < (Z,i) for all v. This shows that every totally ordered collection 
of extensions of i has an upper bound. So the hypothesis of Zorn’s lemma is sat¬ 
isfied, and we conclude that there exists a maximal extension. But according to the 
foregoing, a maximal extension must be to the whole space X. □ 


3.2 GEOMETRIC HAHN-BANACH THEOREM 

In spite (or perhaps because) of its nonconstructive proof, the HB theorem has plenty 
of very concrete applications. One of the most important is to separation theorems 
concerning convex sets; these are sometimes called geometric Hahn-Banach theo¬ 
rems. 


Definition. X is a linear space over the reals, S a subset of X. A point xq is called 
an interior point of S if for any j in X there is an depending on y, such that 

xq + ty s S for all reaU, \t\ < €. 

Let K be a. convex set that has an interior point, which we take to be the origin. 
We denote the gauge pk of K with respect to the origin as follows: 

PkM = inf a a > 0,- e K. ( 8 ) 

a 

Since the origin is assumed to be an interior point of K, 
p K (x) < oo 

for every x. 

Theorem 2. The gauge pk of a convex set K in a linear space over the reals is 
positive homogeneous and subadditive. 

Proof. Positive homogeneity follows from the definition ( 8 )，even when K is not 
convex. To prove subadditivity, let x and y be any pair of points in X, a andfr positive.. 
_aumMts such thaL.... •_ _ ____ : _ 

-&K, ^-eK. (9) 

Cl 0 

Convexity, as defined in chapter l, means that any convex combination of points of K 
belongs toi^T. We take the convex combination ofx/a and y/b with weights a/( a+fe) 
and b/(a + 6 ). These are nonnegative numbers whose sum is 1. We conclude that 

a x , b y x+y 

--- 1 - - ^ - 7 ^ 尺 . 

a + b a a + 1? b a + b 
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Since + + is in X3y definitibn + &. Since this holds 

for all a and b satisfying (9 )， . ... 

Pk(x + y) < inf (a + b) = infd + infb = pkM + p^ty), 

where in the last step we have again used (8). This proves subadditivity of pk ， □ 

Theorem 3. For any convex set K, 

Pk(x) <1 ifxeK, (10) 

PK (X) < 1 iff x is an interior point of K • (10 ’） 

Proof. (10) is an immediate consequence of definition (8) of pj(. □ 

Exercise 1. Prove (100. 

The converse of theorem 3 also is true: 

Theorem 4. Let p denote a positive homogeneous, subadditive function defined on 
a linear space X over the reals. 
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i{x) > c, or i{x) < c, 

are called closed halfspaces. 

Theorem 5 (Hyperplane Separation Theorem). Let K be a nonempty convex sub¬ 
set of a linear space X over the reals; suppose that all points of K are interior. Any 
point y not in K can be separated from K by a hyperplane i(x) = c; that is, there is 
a linear functional depending on y, such that 

l{x) < c for all x in K\ i(y) = c. (11) 

Proof. Assume that 0 e K, and denote by pk the gauge of K. Since all points of 

K are interior, it follows from theorem 3 that /? 尺 U) < 1 for every x in K. We set 

t{y) = 1. (12) 

Then i is defined for all z of the form ay, 

t{ay) = a. (120 

We claim that for all such z, 

^)< Pk(z). 

This is obvious for a <0, for then C(z) < 0 while pk > 0. Since y is not in K, by 
( 8 ), pK(y) 5: 1. So, by positive homogeneity, PK^y) > a for a > 0. 

Having shown that i, as defined on the above one-dimensional subspace, is dom¬ 
inated by pk, we conclude from the HB theorem that i can be so extended to all of 
X. We deduce from this and (100 that for any x in K, 

< PkM <1 

This gives the first part of (11), with c = 1; the second part is (12). □ 

Corollary 5 f . Let K denote a convex set with at least one interior point. For any y 
not in K there is a nonzero linear functional t that satisfies 

_ Ux) < Uy) for all x in K. _(13). 

Theorem 6 (Extended HypeBlS 為⑤ 瓦 

and M disjoint convex subsets of X;atieast one of which has an interior point. Then 
H and M can be separated by a hyperplane t{x) = c; that is, there is a nonzero 
linear functional i t and a number c f such that 

t(u) <c <HvV~ ~ — TT47 

for all u in H, all v in M, 


% 










Proof. According to theorem 5 of chapter 1, the difference set H 一 M = AT is 
convex; since either H ox M contains an interior point, so does K. 

Since H and M are disjoint, 0 ^ K\ according to (13) of corollary 5’ applied to 
y = 0, there is a linear functional i such that 

<£(0) = 0 for all x in K. (15) 

Since all x in 尺 =H — M is of the form jc = w — u, w in u in M, (15) means that 

Uu) < l{v)\ 

(14) follows from this, with c = sup uefi t{u). □ 


3.3 EXTENSIONS OF THE HAHN-BANACH THEOREM 


The following extension of the H-B theorem, due to R. P. Agnew and A. P. Morse, is 
both useful and beautiful: 


Theorem 7. Let X denote a linear space over the reals and A be a collection of 
linear maps A v : X X that commute; that is ， 

AyAjLt = A^/,A.y ( 16 ) 

for all pairs in the collection. Let p denote a real-valued, positive homogeneous, 
subadditive function on X — see (1) and (2)—that is invariant under each A v : 

p(A v x) = p(x). (17) 

Let Y denote a linear subspace of X on which a linear functional l is defined, with 
the following properties: 


(i) i is dominated by p, namely 

^(y)< P(y) (18) 

for evejy y in Y. ’ 

(ii) Y is invariant under each mapping A, namely 

for y in Y f Ay in Y. (19) 

(iii) l is invariant under each mapping A, namely 

£(A：y) = i(y) for y in Y. (19’) 


Assertion, t can be extended to all ofX so that £ is dominated by p in the sense of 
(18)，and is invariant under each mapping A u . 





Proof. If (17) holds for two mappings A and B of the collection A, it also holds 
for their product AB，defined as their composite. Similarly, if (19) and (19’）hold for 
A and B, they hold for the product AB. Likewise, if A and B commute with all A v , 
so does their product. Thus we may adjoin to the collection A any finite products and 
the identity I. This enlarged collection will now form a semigroup. Then, if A and B 
belong to it, so does their product AB. From now on we assume that the collection 
乂 is a semigroup under multiplication. 

We define a new function g on X as follows: 

g(x) = inf/?(Cx), (20) 

with C a convex combination of mappings in A, namely maps of the form 

c = a j > o, = 1, Ay in A. 

Since 乂 is a semigroup, the product of two convex combinations of mappings in A 
is also a convex combination. 

Using subadditivity, homogeneity, and invariance (17), we deduce that 
p(Cx) = p < J2 a jP(Ajx) = p(x). 

Since in (20) we may take C to be the identity, it follows that 
gM < pM， 

Since p is positive homogeneous, it follows from (20) that so is g. We show next that - 

g is subadditive. 

Let.x and y be arbitrary elements of X. By definition (20), for any € > 0 there are 
maps C and D in the convex hull of A such that 

p(Cx) < g(x) + e, p(JDy) < g(y) -f e.. 

Applying (20) to the map CD, we get, since C and D commute, that 

g{x + y)< p(CD(x + y)) = p(DCx + CDy). 

Using subadditivity, and (21), the right side of (23) is seen to be less than 

- — - p(.DC^-4-/?4CD->9-^ p(Dy)- - (24)- 

Using (22) to estimate (24)，we conclude that 

g(x + y) < g(x) + 茗 (30 + 2e; 

since € is arbitrary, subadditivity of g follows. 

Since, by (190, £ on F is invariant under each A, for any convex combThatioiTC 
of mappings in A and for any y in Y, 


( 22 ) 

(23) 







£(Cy) = i (j2 a J A jy) = X^( A P’) = X^a/Cy) = i(y). 

It follows from (19) that if y belongs to Y, so does Cy. Applying (18) to Cy, we get 
that for y in Y, 

UCy) < p(Cy). 

Since we have shown that t(Cy) = €(v),__ 

^(>0 < P(C> 0 ； 

by definition (20) of it follows from this that for all y in Y, 

Uy) < g(yy (25) 

We apply now the Hahn-Banach theorem to conclude that £ can be extended to all 
of X so that (25) holds. We claim that € thus extended is invariant under all mappings 
A in ^4 in the sense of (19). For any Ain ^4 and any natural number n, we define 
by C n = i Ed— 1 A 人 Since 乂 is a semigroup, C n belongs to the convex hull of A. 
According to the basic formula for geometric series, C„(I 一 A) = |(I — A 71 ). 

Let x be any point in X; by definition (20) of g, 

g(x - Ax) < p{C n {x - Ax)) = p{C n (l- A)x) = -p{x - A n x). (26) 

n 

In the last step we used the formula for geometric series, and the positive homogene¬ 
ity of p. Using subadditivity and (17)，we deduce that 

- A n x) < -[p(x) + p(-A n x)] = 丄 [p(x) + p(-x)]. 
n . n n 

Combining this with (26), we get 

g{x - Ax) < - [p(x) + p(-x)]. (260 

n 

Now we let n oo; since the right side of (26’）tends to 0, 

g(x-Ax)<0, (27) 

/ 

Since g dominates t, we deduce from (27) that 

l(x - Ax) < 0. 

Since i is linear, this implies that for all x, 

£(x).<i(Ax). (270 


Replacing x by 一 ; c，we get 


(一 Ajc )， 
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which is the opposite of inequality (27’). So equality must hold，meaning that t is 
invariant under each A. 

By construction, t is dominated by 尽 . It follows then from (21') that it is dominated 
by p. □ 

Exercise 3. Show that theorem 7 remains true if condition (17) is replaced by 
P(Ajc) < p(x). 

We conclude by a version of HB for complex linear space due to Bohnenblust and 
Sobczyk, and Soukhomlinoff: 

Theorem 8. Let X be a linear space over C, ancl p a real valued function that 
satisfies 

(i) 

P(cix) = \a\p(x) (28) 

for all complex a，all x in X; 

(ii) subadditivity, 

p(x + y) < p(x)-\- p(y). 

Let Y be a linear subspace ofX over C, and let i be a linear functional on Y that 
satisfies 

\Uy)\ < p(y) fory in Y. (29) 

Assertion, t can be extended to all ofXso that (29) holds over X. 

Proof. Split t into its real and imaginary part: 

= + _ _ —^(30) 

Clearly, i\ and i 2 are linear over R, and are related by 

. . . . Uiiy) = -h(y)^ _(3j0_ 

- 弨 lyrif A: 这才 li 抵 wf^nctional overR ， : 二二 ―― 广 ― —- : ------. 

t(x)=l\(x)-iZ\(ix) ' (310 

is linear over C. 

We turn- now to-the tasbof-extending4 Jt-follows-from-429)>and-(-3())-tlmt - —- 

^\{y)< p(y )： (32) 





Therefore by the real H-B theorem, t\ can be extended to all of X so that (32) holds. 
We define £ on Z by (31). Clearly, C is linear over C and we claim that (29) holds. 
To see this，write 


t{x) = ar, r real, \a\ = 1. 

Then . 

…— —|^|-=r = cT 1 取 你 ，; T); 石 ~ — 

This completes the proof of the complex H-B theorem. □ 

A historical review and a modem update is given by Gerard Buskes in his survey 
article. 
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4 

APPLICATIONS OF THE 
HAHN-BANACH THEOREM 


4.1 EXTENSION OF POSITIVE LINEAR FUNCTIONALS 

S denotes any abstract set, and B = B(S) the collection of all real-valued functions 
x on S that are bounded, that is, satisfy 

\x(s)\ < c. (1) 

丑 is a linear space over the reals. 

There is a natural partial order for the elements of B : jc < 3 ^ means that x{s) < 
y(s) for all s in S. A function x satisfying 0 < jc is called nonnegative. 

Let F be a linear subspace of B that contains some nonnegative functions. A 
linear functional l defined on Y is called positive on Y > 0 for all nonnegative 
y in Y. Every positive linear functional l is monotone: 

... y\ < yi implies < i(y 2 ). ( 2 ) 

Theorem 1. Let Y be a linear subspace of B that contains a function ^0 greater 
than some positive constant，say 1: 

1 < : yo ⑻ for all s in S, (3) 

Lett be a positive linear functional defined on Y • 


Assertion, i canbe extended to all ofB as a positive linear functional 
Proof: We define the function /? on B as follows: for any x in B ， 

PM = mf£(y) f x<y\y in Y. ⑷ 

This*function-/? is well-definedffor-itfoHows-fronrCi^andtS^thatr - 

-cy 0 <x < cy 0 , (5) 

29 
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which shows that the inf in (4) is over a nonempty set, and that p (x) < ci(yo) where 

c is any constant satisfying (1). The smallest such constant is c = sup s - in -^-|x(^-)-l- - 

It follows from (5) that any y > x satisfies —cjo < x < y. Since i is linear and 
positive, for such y it follows from (2) that -ci(yo) < l{y), and so by (4) 

-cZ(y 0 ) < p{x). (6) 


Lemma 2. The function p defined by (4) is 

(i) positive homogeneous. 

(ii) subadditive. 

(即 negative: p(x) < Oforx < 0. .... . . . 

(iv) p(x) = i{x)forx in Y. " 

Proof- 

(i) It follows from the definition that x < ,y implies ax < ay, a > 0. Positive 
homogeneity follows from definition (4). 

(ii) Let x\ and 文 2 be any two functions in B, ji and any two functions in Y 
satisfying 


< yo. 


Adding the two we obtain x\ -f x% <y\ + yo \so by definition (4) of p. 


P(^\ +x 2 ) : 






P(xi) + p(x 2 ) 


⑺ 


This proves subadditivity. 

(iii) Suppose that x < 0; then y = 0 is admissible in the inf on the right in (4), 
giving p(x) < €(0) = 0, as asserted in (iii). 

(iv) Suppose that jr belongs to Y; then by (2), x <y implies £(x) < t{y), equality 
holding for y =^x. Setting this into (4) gives p(x) = t(x), as asserted in (iv). 


It follows from lemma 2 that we can apply the Hahn-Banach theorem to extend i 
from Y to all of B so that 名 remains dominated by p: 

< P(x). ( 8 ) 

Suppose that x is nonpositive. Then by (iii), p(x) < 0, so by (8), 

< 0 forx<0, (9) 

This shows that i is positive, as asserted in theorem 1. 


□ 





Theorem 1 is a special case of a very general theorem of Mark Krein; see p. 20 of 
Kelley and Namioka. 

4.2 BANACH LIMITS 

B denotes the space of bounded infinite sequences x of real numbers, 

t …) • (10) 

5 is a linear space over the reals when vector addition and multiplication by a scalar 
are defined componentwise. We define the function p on B as follows: 


neons function of x\ we leave it as an textitexercise to the reader to prove that p is 
subadditive. 

Define A as left translation, that is, 


It is an immediate consequence of definition (11) that p is translation invariant, 
namely that 


We define Y as the space of convergent sequences of real numbers. Clearly, Y 
a linear subspace of B. On Y, we define the linear functional i by 


Clearly,_€ is linear. Comparing definitions (11) and (14), we conclude that 


..Q e . ar Iy»_L!.? mapped into itself by translation; equall; 
under translation: 


l(Ay) — i(y) for y in 


We apply now theorem 7 in chapter 3 to conclude that i can be extended i 
bounded sequences x in B so that 
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(i) £ is linear 

(ii) i is invariant under translation 

(iii) i is dominated by p. 


Theorem 3. To each bounded sequence (10) we can assign a generalized limit (or 
Banach limit), denoted as 

___ 1 — ; _ 

so that 

(i) For convergent sequences the generalized limit agrees with the usual limit. 

(U) . 

LlM(a n +b n )= LM 為 + LIM 

« — >00 n-^OO 77_»00 

(iii) For any k 

LIM a n +k = LIM a n . 

n—^-oo n-^-oo 

(iv) 

liminfa n < LIM^ < lim supa w . 

«->oo n-^oo 

Proof. We set, in the notation of (10), 

3JDM = t[x). 

Part (i) follows from (14) ， (140 ； part (ii) expresses the linearity of l\ part (iii) is the 
translation invariance of t. Part (iv) expresses the domination of £ by p, as defined 
by (11), and applied to £(x) and £(—x): 

-pi-x) < £(x) < p(x). □ 

Exercise 1. Show that if in section 4.1 we take S = {positive integers), Y the space 
of convergent sequences, £ defined by (14), the function p given by (4) is the same 
as defined by (11). 

Exercise 2. Show that a Banach limit can be so chosen that for any bounded se¬ 
quence (ci,C 2 ,...) that is Cesaro summable; namely the arithmetic means of the 
partial sums converge to c, 

LIM c n = c. 

tl-^OO 


Exercise 3. Show that a generalized limit as r oo can be assigned to all bounded 
functions x(t) defined on / > 0 that has properties (i) to (iv) in theorem 3. 




4.3 FINITELY ADDITIVE INVARIANT SET FUNCTIONS 


The Lebesgue measure on the unit circle is invariant under rotation. This measure can 
be extended to a considerably larger cr-algebra than the Lebesgue measurable sets on 
the unit circle so that rotational invariance is retained. However it is well known, 
and easy to show, that if we accept the axiom of choice, then there is no rotationally 
invariant countably additive measure defined for all subsets of the circle. We show 
now 

Theorem 4. One can define a nonnegative finitely additive set function m{P), for 
all subsets P of the circle, that is invariant under rotation. 


Proof. We take S to be the unit circle, and B the set of all bounded real-valued 
functions on S. We take Y to be the space of bounded, Lebesgue measurable func¬ 
tions on 5, and take i{y) to be the Lebesgue integral of y: 

Uy) = J^ymde. (17) 

The space Y contains the function yo = 1, so condition (3) of theorem 1 of sec¬ 
tion 4.1 is fulfilled. Therefore the function p described there by equation (4) is well 
defined. 

We denote by {A^} the action on function of rotations p of the circle. As remarked 
above, l is invariant under rotation: 


(A p ym = v(0 + p), m p y) =e(y). (18) 

Since the relation x < y also is invariant under rotation, it follows that p as defined 
by (4) is rotation invariant: 


p(A p x) = p(x). (180 

Rotations of the circle commute, and so the linear maps {Ap} form a commuting 
group of maps. We apply now theorem 7 of chapter 3 to conclude that t can be 
extended to all of .5_aa_that .€_is_J. . . . ... ... 


(i) linear. 

(ii) invariant under rotation. 


Let P be any set of points of the circle 5; denote by cp its characteristic function: 


c P (9) 


0 


if 9 is in P 
otherwise. 


(19) 


Wedefine-the-setrfunction^bysettingr 


m(P) = i{c P ). 


(190 
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As shown in theorem 1， it follows from t{x) < p{x) that t is positive. Since cp is a 
—nonnegative function, it follows from definition (19’）of m that m is nonnegative: 

m{P) > 0. 

Let p be'aliy rotMion; denote"the~set\P rotated by p as P + p. It follows from the 
二二 d&finition<49) of-c • 尸 -that - 一 - - 


…. c P ^ p =A p cp. (20) 

Since i is rotation invariant, it follows from the definition (19’）of m that 

m(P + p) 

meaning that m is rotationally invariant. 

Let Pi arid Po be disjoint subsets. Then, by definition (19 )， 

cp x up 2 = C P\ +c/> 2 . 

Setting this into the definition (19) of m，and using the linearity of l, we deduce that 
m{P\ U ft) = m(P 2 ) - 

This proves that m is finitely additive. □ 

NOTE. Rotations of the circle commute with each other, and so the operators Ap 
commute; this was needed in invoking theorem 7 of chapter 3. Rotations of the three- 
dimensional sphere do not commute, and neither do the corresponding operators A p . 
Therefore the above proof cannot be used to extend theorem 4 to three dimensions. 
In fact Hausdorff has shown that the three-dimensional analogue of theorem 4 is 
false; there is no rotational invariant, finitely additive set function on the 2-sphere. 
The proof is based on a finite decomposition of the 2-sphere, sometimes called the 
Banach-Tarski paradox. 

In conclusion, we point out that the duality theory of Banach spaces constitues the 
richest applications of the Hahn-Banach theorem. These are described in chapters 8 
and 9. 

HISTORICAL NOTE. His name is etched into the foundations of modern analy¬ 
sis: Hausdorff space, Hausdorff maximality principle，and Hausdorff measures are 
household concepts. He was a German mathematician, bom in 1868; as a young man 
he published several volumes of poetry and aphorisms. He spent most of his pro¬ 
fessional life as professor in Bonn. Because he was Jewish, in 1942 he was ordered 
deported，part of the “Final Solution” to kill all the Jews in Europe. Knowing what 
awaited them, Hausdorff, his wife, and sister-in-law committed suicide. 
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NOMED'LINEAR SPACES 


5.1 NORMS 

Let X denote a linear space over M or C. A norm in Z is a real-valued function: 
Z M，denoted as \x\, with the following properties: 

(i) Positivity^ 

|x|>0 forjc^O; |0| = 0. (1) 

(ii) Subadditivity, _ 一 

\x + y\<\x\ + \y\. (2) 

(iii) Homogeneity• For all scalars a, 

M = |a| | 又 |. ⑶ 

With the aid of a norm we can introduce a metric in X, by defining the distance 
of two points to Be 

d(,x,y) = \x-yl (4) 

It is easy to verify that this has all properties of a metric. Conversely, it is easy to show 
that every metric in a linear space that is translation invariant and homogeneous: 

d ( x ~\rz,y + z) = d(x,y) t d{ax,ay) = \a\ d(x, y) (4 ; ) 

comes from a norm via (4). ^ 

With a metric (4) we can employ topological notions such as convergent series, 
°P e ^ sets, closed sets, and compact sets. Those notions turn out to be crucial. 
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Definition, Two different norms, \x\\ and \x\ 2 , defined on the same space X are 
called equivalent if there is a constant c such that 

c\x\x < \x\ 2 < c _l |-v|i (5) 


for all x in X. 

The significance of this notion is that equivalent norms induce the same topology. 
In chapter 1 we looked at various ways of building new linear spaces; the same 
constructions can be used to build new normed linear spaces. Specifically we ob¬ 
served the following: 


(i) A subspace F of a normed linear space X is again a normed linear space. 

(ii) Given two linear spaces Z and U, their Cartesian product, denoted as a direct 
sum Z ㊉ C/，consists of all ordered pairs (z, w), z € Z,u € U. When Z and U 
are normed, Z ㊉ t/ can be normed, such as by setting 

|(z，M)| = |z|+|M|, |(z,«)「= max{|z|,|M|}, or \{z,u)\" = (|z| 2 +|“| 2 ) 1/2 . 

( 6 ) 


Exercise L 

(a) Show that (6) are norms. 

(b) Show that they are equivalent norms in the sense of (5). 


Let Xbea normed linear space, Y a subspace. We saw in chapter 1 that we can define 
their quotient X/Y as a linear space. We raise now the question: is there a natural 
way to introduce a norm in the quotient space? The answer is yes, provided that Y is 
closed: 


Theorem 1. Let Y be a closed subspace of a normed linear space X. Let [xj } be an 
equivalence class of elements ofX mod Y. We define 

\{xj}\ = ⑺ 

Assertion. (7) has all properties of a norm in the quotient space X/Y. 

- ^ 0 /-PFopert-y-(^),-homQgeneity,-holds-triviall.y. To.-verify subadditivity,Jet. {xy.}_ _ 

-.and [zj] denote two equivalence classes. For any 6 > 0 we can, by definition (7), 
choose representatives so that 

\xj\ < |{x；}|+€, |zy| < \[zj}\ + €. (8) 

By definition of addition in X/Y, Xj + Zj belongs to {々} + {zj}\ therefore, by 
— "definition (7), 


|{x 7 *} + {zy}| < \Xj^Zjl 






which by subadditivity in X, and (8), is 

训 + M<iM + i_+2f 

Since this is true for all e > 0, subadditivity of the norm (7) follows.. 

Clearly, (7) is nonnegative. To show positivity, suppose that |{j ： y}| = 0. By defi¬ 
nition (7), there is a sequence of elements x n in [xj] such that 

= ⑼ 

By definition of equivalence, the equivalent elements x n differ from each other by 
elements that belong to Y. In particular, we can write 

x n = x\ -y n , n = 2,3,..., y n in Y. 

Setting this into (9)，we see that 


which by (4) means in the language of metric spaces that 


(9 ; ) 


In a metric space, the limit of a sequence of elements in a subset Y belongs to the 
closure of Y. Now, since Y is assumed to be closed, (9’) implies that x\ belongs to 
Y. But then the whole equivalence class {xj} consists of elements of y, which is the 
zero element in X/Y. □ 

Theorem 2. Let X be a nonned linear space, Y a subspace ofX. The closure ofY 
is a linear subspace ofX. 


Exercise 2. Prove theorem 2. 


For purposes of analysis，in the construction of objects with desirable properties 
through limiting processes, we need metric spaces that are complete in the sense that 
every Cauchy sequence has a limit. So it is with nonned linear spaces: 

Definition. A Banach space is a nonned linear space that is complete. 

We recall the process of completion of a metric space whereby any metric space 
S is embedded in a complete metric space denoted as S, consisting of equivalence 
classes of Cauchy sequences. 5 1 is a dense subset of S, i.e. the closure of S is S. 

Theorem 3. The completion X of a normed linear space X under the metric (4) has 
a natural linear structure that makes X a complete nonned linear space. 

















(e) 2 a topological space, X the space of all complex-valued, continuous func¬ 
tions / with compact support J!he4iorm-is_ 


|/|max = nm|/(^)|. 


(130 


This space is not complete unless Q is compact. 

(f) D some domain in K 77 , X the space 勺 f continuous functions-y^ithiompacr ^ 二 


supportT^fte-nonB-i 


l/lp = ( 乂 \f {x)\ p dx ^ ， 1 < p. 


(14) 


This space is not complete; its completion is denoted by L p . 

(g) Z) some domain in M 71 , the space of all C°° functions f in D with the following 
property: for some integer k and p >1, 

J \B a f\ p dx <oo for all \a\ <k, 

where 3 a is any partial derivative: 

3® = 3® 1 ... d^", dj = |a| =«] + 


The norm is 


i/U.p = 




I/P 


\d a f\Pdx 


(15) 


This space is not complete; its completion is denoted as W kiP , and is called a 





NORMS 

We take first p = oo. By (10)， 

|A ： + y|oo = sup|a ； < sup \aj\ + \bj\ 
j J 

< sup \aj I + sup|fcj| = \x\oc + l^loo* 


Next we turn to p = 1. By (11), 

+ ^ \^j + ~l S [㈣ + \bj\ = |j：h +.1^11. 

For 1 < p < oo we need Holder’s inequality. To state it, we introduce vectors u 
with finite g-norm: 


u = {ci,c 2> ...}- (J2\ c j\ q )' /q = \ u U < ⑽ 

where q is conjugate to p, in the sense 

- + i = l. (17) 

p q 


We define now a scalar product between vectors in l p and i q as follows: 


(JC, W) = Y^CljCj. 


(18) 


Holder's Inequality. For ;c in 名' « in 作 the series defining the scalar product (18) 
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|义 + ) ， | 及 =max + w)| < max |(x,w)| + |(>»,w)|. (22) 

l«l 9 =l 

By Holder's inequality (19), for \u\ q = 1, 

< \x\ p , \(y\u)\<\y\ p . 

Setting this into (22) gives 

一 一 ■ — — ... - -- -;- |艾寸了 l^r^ixl^riHTt^ -- 

as asserted in theorem 4. □ 


The self-conjugate case p = q = -2 is an instance of a supremely important class- 
of norms, to be discussed in the next chapter. 

The norms defined in examples (f) and (g) satisfy important inequalities due to 
Sobolev: If 


mp < n 


and 


p<q < 


np 

n — kp 


and if <2 is a cube, then 


(23) 


\f\q < const. \f\k,p, (23') 

where the constant depends only on p,q'k ， n. These inequalities hold of course for 
all Q that are the images of cubes under a smooth mapping. Even more generally, 
they hold for all domains O that satisfy a cone condition. For a proof, see Adams or 
Mazya. 

Since the spaces L q and W ktP are constructed by completing the space of smooth 
functions in the appropriate norms, it follows that if condition (23) is fulfilled, W m,/? 
is contained in Lfl • 

The normed linear spaces stiidied and used in analysis are 印益棚洛 )1. 

According to Cantor’s theory of sets, there is a gradation among infinites; the least 
of them are the countable sets. 


Definition. A normed linear space is called separable if it contains a countable set 
of points that is dense, namely, whose closure is the whole space. 

Most, but not all, spaces that are used in analysis are separable. Here is an impor¬ 
tant example that is not: 

(h) The space of all signed measures m on, say, the interval [0,1], of finite total 


define tl 




Denote bym y the unit mass located at the point Clearly, for v ^ z, \m = 
2. Since there are nondenumerably many points y in the interval [0, 1]，this shows 
that the space of measures is not separable. 


5.2 NONCOMPACTNESS OF THE UNIT BALL 

Many existence theorems in finite-dimensional spaces rest on the fact that the closed 
unit ball ， meaning that the set of points 

B\ = [x\ |.v| < l} t (24) 

is compact, that is to say, that any sequence of points in B\ has a convergent subse¬ 
quence. F. Riesz has shown that this property characterizes finite-dimensional spaces: 

Theorem 6. Let X be an infinite-dimensional normed linear space; then the unit 
hall B\ defined bv (24) is not comoact. 








It follows from (26) that 


1^1 < 2d. 






Clearly, (25) holds, and (25 7 ) follows from combining (27) and (28). □ 

Remark 1. Clearly, the number j on the right of (25^) can be replaced_bY_any_ — 
number < 1. 

We turn now to the proof of theorem 6 . We construct a sequence { 3 ^} of unit 
vectors recursively as follows: y\ is chosen arbitrarily. Suppose that )，i，• • • ， 办 -i 
have been chosen; denote by Y n the linear space spanned by them. Since Y n is finite¬ 
dimensional, it is closed; since X is infinite-dimensional, Y n is a proper subspace of 
X. So lemma 7 is applicable and a z with properties (25)，（25’）exists. We set 

yn ― 2 . 

Since yj ， j < n belongs to Y n 

\yn -yj\ > 5 , j < 

This shows that the distance of any two distinct yj exceeds Therefore no sub¬ 
sequence can form a Cauchy sequence. Since all yj belong to the unit ball B\ , it 
follows that B\ is not compact. □ 

Exercise 4. Prove that every finite-dimensional subspace of a normed linear space is 
closed. (Hint: Use the fact that all norms are equivalent on finite-dimensional spaces 
to show that every finite-dimensional subspace is complete.) 

Next we describe a kind of a substitute for the compactness that is lacking in the 
unit ball. 

Definition. A norm is called strictly subadditive if in (2) strict inequality holds ex¬ 
cept when x or y is a nonnegative multiple of the other. 

Exercise 5. Show that the sup norms of examples (a), (c)，(d), and (e) are not strictly 
subadditive. 

Exercise 6. Show that the norms in examples (b) and (f) are not strictly subadditive 
for p = 1 . 

All the norms in examples (b) and (f) are strictly subadditive when 1 < p < 00 . 
Furthermore for each of these norms the condition holds uniformly, in the following 
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For any pair of unit vectors x, y, the norm of{x-\-y)/2 is strictly less than 1 by an 
amount that depends only on \x — y\. More explicitly, there is an increasing function 
€(/•) defined for positive r, 

€(r) > 0, lim 6(r) = 0, (29) 

such that for all x, y in the unit ball \x\ < 1, \y\ < 1, the inequality 

. |^y^| < l-e(l^-^l). (30) 

holds. 




d, bounded sets K that are not compact, 
i to Clarkson, as is the result that the L p 

X). 

at in the space C, which is not uniformly 
even strictly subadditive~the conclusion 


mtinuous real-valued functions defined 














and similarly that 


min [z(r) - A(0] < -i, (36) 

0<f<l v y 

equality holding iff 

z{t) - k{t) s -1 forO < f < 1. (360 

Conditions (35’）and (36’）cannot both hold at f = 0, so it follows that in at least one 
of (35) or (36) inequality holds. This proves that 

U 一 ^Imax > 1 (37) 

for any k in K. On the other hand, one could choose ^ in AT so that the max and min 
in (35) and (36) are as close to 1 and 一 1 ， respectively, as one wishes. So 

= 1. (370 

This combined with (37) shows that there is no closest point to s in K. □ 

5.3 ISOMETRIES 

We turn now to isometries of a Banach space X onto itself, meaning mappings M of 
X onto X which preserve the distance of any pair of points: 

\M(x) - M(v)| = \x-y\ for all x, yin X. (38) 

Clearly, translations M(.v) = x^u,u fixed, are isometries. Also the isometries of 
X forma group. We want to investigate those isometries that map 0 into 0; all others 
can be obtained by composing these with a translation. 

Theorem 9. Let X be a linear space over the reals with a strictly subadditive norm. 
Let M be an isometric mapping ofX into itself that maps the origin into itself. Then 
'M~is"tinear.' 

Proof. Denote for simplicity M(.r) by x\ Take any pair of points x and y and 
define 

x -hy 

一一 — - . 5 ： = (39) 

Using isometry as stated in (38), and the definition of z, we have 

_ = = 

\z! - /I = ： \z-y\ = , (40) 









.Since by (40) they have the same norm, they must be equal: — V == 


2 ， =W. 


Deduce from (41) that M is linear. 


(41) 

□ 


: of life that some Banach spaces are very rich in isometries; others are 
inong the rich ones are the Hilbert spaces discussed in chapter 6; among 
；s are function spaces with the max norm. Here is an example due to 

f X the space of null sequences of complex-numbers 

x = {a n }, lim a n = 0 (42) 


\x\ = max \a n \. (42’) 


Show that'X is complete. 


be an arbitrary sequence of complex numbers of absolute value 1: 
efine the mapping U by 


Ua* = [bjjCln}. 


(43) 


a linear map of X onto X, and satisfies \\Jx\ == \x\; thus U is an isometry, 
a permutation of the positive integers. Define the map P by 

Pa* = l^}, o! n = % ⑻. （ 44) 


a linear map of X onto X, and an isometry. 


(defined by (42), (42，）is 
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Proof, Let itj be a yth unit vector, i.e. a vector whose jth component has absolute 
value 1, all others are zero. Denote by Tj the linear subspace of X consisting of all 
vectors whose jth component is zero. Clearly, 

Tj is closed and codim 7) = 1, (45) 

\uj + r| = 1 for all f in 7)，|f I £ 1. (46) 

Conversely, we have 

Lemma 11. Let u be a vector in X, \u\ = 1, and T a subspace of X of codimen¬ 
sion 1 so that (45) and (46) hold. Then u is a unit vector and T the corresponding 
subspace Tj. 

Proof. By definition (420 of the norm 

1 = |m| = \u m \ for some index m. 

It follows from (46) that no vector r in 7 can have an mth component ^ 0. Since 
T is assumed to have codimension i, it follows that T consists of all null sequences 
whose mth component is zero. From this it follows, by (46)，that all components of 
ii other than the mth must be zero. □ 

Let M be a linear isometry, of X onto X; let uj be any unit vector and Tj the 
corresponding subspace. Since M is linear, isometric, and onto, it follows that w) 
and Tp the image of iij and Tj under M, satisfy (45) and (46). Then by Lemma 1 1 ， 
u' is a unit vector; from this and the linearity of isometry theorem 10 follows readily. 
We conclude this chapter with the following result due to Mazur and Ulam: 

Theorem 12. Let X and X 1 be two normed linear spaces over ther reals, M an 
isometric mapping ofX onto X’ that carries 0 into 0. Then M is linear. 

Jhe.case. whereJhe.norms are strictly subadditive is covered in theorem 9. 
In the general case we take, as before, x and y to be any pair of points, and z their 
midpoint: 


又土乂 ■ 



As before，in (40), z is halfway between x and y, but when the norm in X is not 
strictly subadditive，this no longer characterizes the midpoint z. There may be other 
points u also halfway between x and y: 


\x - u\ = \y-u\ = — 2 (47) 












respect to the midpoint z. That 


v 


To see this, we note that •= a * + )’， an 

.. . d —.x =r y —-W, 

It follows from (47) that v is halfway be 

We define the diameter of -A as t 
of A: . 


= s 

M,Ui 

Since A is symmetric with respect to z, 

Of course, there may be other points p i 

\u-p\< \d A 

We denote the set of all such p by A\. 
to the midpoint z. That is, if p belongs I 

For using (48), we can write for any u ii 
q —u — 2z- 

We conclude from (51’) that | 分 — u\ = 
it follows from (50) that \u—q\< 士心 
It follows from (50) that the diamel 
of A: 


d Ai 

We now repeat this construction, obtai 
A 2 .. * ， each containing the midpoint z 
diameters satisfying 

d A„ + 

Clearly, d^ n tends to zero; it follows tl 
of the single point z. This 
metric structure of X. 


a nested seauence ot sets / 


characterizes 





Let M be an isometric mapping of X onto X f . Then the inverse of M maps X f 
isometrically onto X. Denote by x f and y f the images of x, y under M, and denote 
by A’, .... A f n the sets analogous.!:。the ones defined in X. Recall that the set A 

was defined by (47), and that the set A j was defined by (49) and (50). Since these 
inequalities refer only to distances, and since M is an isometry, it follows that M 
maps every point of A n into A l lv Since the inverse of M is isometric, it maps every 
point of A f n into A n . Thus M maps A n onto A\ v and thus the intersection of the A n 
onto the intersection of the A f n . Since these intersections are ， respectively, (x + y)/2 
and (x f + /)/2, it follows that 

M ( 宁 ) = 午 . (53) 

Setting y = 0, and using the assumption that M(0) = y f = 0, we get that 
M{x/2) = x’/2. Applying this to equation (53), we get 

M(x ^y)= x f + / = MU) + M(y). 

This is the first property of linearity, see equation (1) in chapter 2. From this we de¬ 
duce that M{kx) = kM(x) for all rational k. Since M is an isometry, it is continuous, 
and so the relation holds for all real k. □ 
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HILBERT SPACE 


6.1 SCALAR PRODUCT 


A scalar product in a linear space X over R is a real valued function of two pointj 
and y in X, denoted as (x, y), having the following properties: 


(i) Bilinearity, For fixed y, (x, }0 is a linear function of x, for fixed x a linear 
function of 


(ii) Symmetry, (y,x) = (x, y). 

(iii) Positivity, (x,x)>0forx^Q. 


When the field of scalars is C, (x, y) is complex valued, and properties (i) and 
(ii) are altered as follows: 


(i) Sesquilinearity. For fixed y, (1) is a linear function of x and for^: fixed (1) is a 



(ax, y) = a(x, )0, (x, ay) = a{x, )0- (1) 

(ii) Skew symmettj, 


{y,x) = (x, y). (2) 

Given a scalar product, we can define a norm, denoted by || ||, as follows: 


\\x\\ = {x,x) ] l 2 . ⑶ 


We claim that || || has the obvious properties of a norm: 

Positivity follows from (iii), and homogeneity from (1 )• To show subadditivity we 
need 


Theorem 1 (Schwarz Inequality). A scalar product satisfying (i), (ii), and (iii) sat¬ 
isfies 
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Replacing x by ax 
equality holds in 







% 

54 HILBERT SPACE 

scalar product is a continuous function of its factors; therefore it can be extended to 
—the-completed space. Thus the completion is a Hilbert space. 

We give some examples of linear spaces with inner product: 

Example L The space of continuous functions .v(r) on the interval [0,1]，with 



This space is incomplete. 

Example 2. The space i 2 of vectors with infinitely many components: 


x = (a\,a 2 ,...)» y = (b \ ，…） 

subject-to the restriction -- 

^2\aj\ 2 < oo, El 〜 -| 2 < 00 . 

We define the scalar product as 

(u) = 

Exercise 3. Show that £ 2 is complete. 

Example 3. The space L 2 of all functions square integrable according to Lebesgue 
on some domain in R n . This space is complete. 

Many other examples will come up in the applications presented in subsequent 



6.2 CLOSEST POINT IN A CLOSED CONVEX SUBSET 


Theorem 2. Given a nonempty closed, convex subset K of a Hilbert space H t and 




he parallelogram identity (6) to x = Cr — y n )/2, y =： (x — y m )/2: 


I 卜 — yn + ym I + i||y n - _y m || 2 = I(cf2 +df n ). (9) 

Since K is convex, (y n +y m )/2 belongs to K, and so by (7), ||jc - {y n +ym)m > d. 
Using this and (8) in (9), we deduce that y n is a Cauchy sequence. Since H is com¬ 
plete and K closed, y = limy« belongs to K. Since \\x —y\\ = lim \\x — y n || =d,y 
minimizes the distance from x. That there is only one minimizer follows from (6): 
suppose that y f is another minimum, and apply (6) to x — y,x — y\ □ 

Theorem 2 is a special case of theorem 8 in chapter 5. 

Definition. Let K be a linear subspace; its orthogonal complement consists of all 
vectors v orthogonal to Y, that is, satisfying (u, y) = 0. It is denoted as 7 丄. 

Theorem 3. Let H be a Hilbert space，Y a closed subspace of H, the orthogonal 
complement of Y• We claim that 

(i) 7 丄 a closed linear subspace of H ; 

(ii) Y and F 丄 are complementary subspaces, meaning that every x can be decom¬ 
posed uniquely as a sum of a vector in Y and in y 丄； 

(iii) (r 丄)丄 =y. 

Proof. It follows from the bilinearity of scalar product that the set of vectors v 
orthogonal to all vectors of any set Y form a linear space. This shows that F 1 * is a 
linear space. Let {uy} be a convergent sequence of elements of F 丄： 

lim Vj = v. (10) 

We claim that v belongs to Y L , namely^ that 


(v, 4) = 0 when z is in Y. 

Since Vj belongs to 


^ = (u - Vj,z). 

By the Schwarz inequality applied on the right, 

l(v,z)| < ||v~ vy|| 

by (10), ||v — yy|| tends to zero. So (11) shows tl 
closed, as asserted in (i). 










56 HILBERT SPACE 

We turn now to (ii). Given anyJc in H, there is, according to theorem 2, a vector 
in Y closest to x. Set 


( 12 ) 

The minimum property of y means that for any z in Y and any real77 - - 

—— : -- : __ ^ _ : _ 

Using (5), we can rewrite the right side as ||u|| 2 + 2f Re(u, z) + r 2 ||z|| 2 and conclude 
that 


Re(u, z) = 0 for all z in F. 


(13) 


v belongs to l 7 丄； （12 ) gives the decomposition of x as y + v, the 

from y pnH nnp frrvm V-*- 


This shows that 

sum of a vector from Y and one from 7 X . 

This decomposition is unique, for if x = j + i; = -f v\ then y 一 ）/ = t /一 u 
would belong to both Y and Y 1 - and thus would be orthogonal to itself. But then, 
by positivity, y — y ! = i /一 i/ = 0. Thus (ii) is proved. Part (iii) is an immediate 
consequence of (ii). □ 


COMMENT. It follows from theorem 3 that every closed linear subspace of a Hilbert 
space has a closed complement. This is not true for all Banach spaces; examples will 
be given later. 


6.3 LINEAR FUNCTIONALS 


A Hilbert space comes equipped with a whole set of built-in linear functionals. For 
y fixed, (x, y) = a Zmea/-of A%-that is, a /rnecr mapping of H 

into C. Furthermore, according to the Schwarz inequality (4), i(x) is bounded by a 
constant multiple of ||x||. It turns out that conversely: 


Theorem 4. Let i{x) be a linear functional on a Hilbert space H that is bounded: 

\e(x)\ < const. \\x\\. (14) 

Then i is of the form 

l(x) = (x, y), y in H. (15) 

The point y is uniquely determined. 

Proof. We will use the following facts: 







(ii) If two linear functionals l and m have the same nullspace, they are constant 
multiples of each other: 


l = cm. (16) 

(iii) The nullspace of a linear functional that is bounded in the sense of (14) is a 
closed subspace. 

Exercise 4. Prove lemma 5. 


space. 

Suppose now that £ is _ 0. Then its nullspace is a closed subspace F of W of 
codimension 1. Its orthogonal complement 7 丄 (see theorem 3) is one dimensional. 
Let p be any nonzero vector in Y 1 -, and define the linear functional m by 

m{x) = {x, p). 

Clearly, the nullspace of m is Y. So by (16) of part (ii) of lemma 5, 

i(x) = cm(x) = {x,cp). □ 

Theorem 4 is called the Riesz-Frechet representation theorem. 

The following useful generalization has been given by Milgram and Lax: 

Theorem 6 (Lax-Milgram Lemma). Let H be a Hilbert space, and B(x, y) a 
function of two vectors with the following properties: 

(i) B{x, y) is for fixed y a linear function ofx, for fixed x a skewlinear function 

ofy- 一 ， _ _ _ 

(ii) B is bounded: there is a constant c so that for all x and y in R 


\B(x, y)|<c||.r|| ||y||. (17) 



TeOvy)i>^lfyll 2 ^^ — 而 


for all y in H. 

Assertion. Every linear functional Ion H that is bounded in the sense of (14) is of 
the form " 


l(x) = B(x ， y) t y a uniquely determined vector in H. 


( 19 ) 
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— Proof. By (i) and (ii), for y fixed B(x' y) is a bounded linear functional of a*. 
Therefore by theorem 4 it can be writtefTas~ : 

B(x, y) == (x, z), z inH. (20) 

Since z is uniquely determined by y, it is a function of y. It follows from (20) that 
the relation of z to y is Zznear; it follows from4his that-the-set^f^appearing4n <-20)^ 

..as._>Ltakes.on ali values-in,.i/_js a of yy , We._c 拉 it ^ a closed _ 

linear subspace. To see this, set x = y in (20): 

y) = Cv, z). (20') 

Using (18) on the left and the Schwarz inequality on the right, we get after dividing 
by ILvIl that 。 

b\\y\\ < IUI|. (21) 

Let {z/ t } be a sequence of vectors appearing in (20), with corresponding y n : 

B{x,y n ) = (x,z w ). (22) 

Subtraction and skew linearity gives B(x,y n -y m ) = (x,z tl - z m ). By (21)，&||)，" 一 
.Vmll < \\z n - Zmll. From this it follows that if z n converges to z, the corresponding 
yn form a Cauchy sequence ： Since H is complete, the sequence {j n } converges to 
a limit y. It follows from (17) that the left side of (22) converges to B(x, y), and it 
follows from (4) that the right side converges to (a*, z). So 

y) = (x, z), 

which proves that the set of ^ appearing in (20) form a closed subspace of H. 

We claim that this closed subspace is all of H ; for if not，then according to theo¬ 
rem 3 there would be a nonzero vector x orthogonal to all z. It follows from (20) that 
such an x satisfies B(x, y) = 0 for all y. Setting y = x gives B(x,x) = 0; using 
(38), we get ||x|| =0, contrary to jc ^ 0. 

According to theorem 4, all linear functionals i(x) can be represented as (jc, c), 

2 in H. Combined with (20); this establishes (19). It follows from (18) that y is 
uniquely determined. 匚 1 


6.4 LINEAR SPAN 

We recall from chapter 1 that the linear span of a collection of points {yj} = S is 
the smallest linear subspace containing them. The closed linear span of a collection 
of points 5 in a Hilbert space H is defined to be the smallest closed linear subspacc 
containing S, that is, the intersection of all such subspaces. 

Exercise 5. Show that the closed linear span of a set is the closure of its linear span. 











Theorem 7. The point y of a Hilbert space H belongs to the closed linear span 
of the set {^y} iff every vector z that is orthogonal to all yj is orthogonal also to y 

(v. z= d for all 7 . that satisfy (v ；. 7 .) = 0 for all i. (2 
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Lemma 8. Let H denote a Hilbert space, {^} an orthonormal set in H, The closed 
~Unearspan~of{xj}~consists-ofuttveaorsofTheform- 


x = J2 a J x j< 

where the a j are complex numbers so chosen that. 

(27) 

\aj P < oo. 

m '、 

The sum (27) converges in the sense of the Hilbert spaa 

e 'nonn. Furthermore 

IWI 2 = mK；| 2 , 

(28) 

and 4 


a j = Xj). 

(280 


Exercise 6. Prove lemma 8. 

Theorem 9. Every Hilbert space contains an orthonormal basis. 


P roo f. Consider all orthonormal sets, partially ordered by inclusion. Given a to- 
mlly ordered collection, the union of all vectors contained in the sets in tihe collection 
includes all of them. Therefore, by Zorn’s lemma, there is an orthonormal set that is 
maximal. We claim that the closed linear span Z of a maximal orthonormal set [xj] 
is the whole space. We argue indirectly: suppose that there is a )，that does not belong 
to the closed linear span X. Define aj by 



(29) 

We claim that Bessel's inequality holds: 



(30) 

for consider 


\\y-J2 a j^j\\ Z > 

(31) 




where F means a finite collection of j. Using the orthonormality of {x/}, we find 
that (31) equals 

- [ 吵，々 ) -XX.(w) + 

F F f 

which by (29) equals 


||y|| 2 -^|fly| 2 . 
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Exercise 10. Show that every infinite-dimensional separable Hilbert space is isb- 
.marphic with_the_space € 2 consisting of all vectors with infinitely many components: 
X = (q， <32, . • •)， subject to the restriction ||x|| 2 = J2 \ a j\ 2 K °°* 


NOTES. The abstract notion of Hilbert space described in this chapter is due to von 
Neumann in 1929. Earlier, Hilbert and his school had used the concrete spaces-de- 
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APPLICATIONS OF HILBERT 
SPACE RESULTS 


7.1 RADON-NIKODYM THEOREM 


Let v and fj. be finite nonnegative measures on the same a-algebra, v is said to be 
absolutely continuous with respect to \x if every set that has /^-measure zero has v- 
measure zero. The Radon-Nikodym theorem asserts that such a v-measure can be 
expressed as 


v(E) = (I) 

where g is a nonnegative integrable function with respect to \x. 

Von Neumann showed how to derive this from the Riesz representation theorem 
for linear functionals in Hilbert space: 

Let H be the real Hilbert space Lr{ix + y), with the norm 

\\xf = J X 2 d(ix + v). (2) 

Assume, for simplicity, that the [x and D measure_of the whole space is finite; then 
it follows, via the Schwarz inequality, that every square integrable function is inte¬ 
grable. The linear functional 

- - g ( . v ) = -(—斗 

is bounded with respect to the L 2 (/i)-norm, so even more with respect to the L 2 (从 + 
u)-norm. Then, by theorem 4 of chapter 6, l(x) can be represented as a scalar product 
(U) for some y in L 2 (/i + v): 



Zxydljr+'v); 




64 APPLICATIONS OF HILBERT SPACE RESULTS 

y depends only on the measures fi and v. We rewrite this as 

Jx(l - y)d/i = Jxydv. (4) 


We claim that 


广一:; —:一~一— 


except for a set of /x-measure zero. To show this, we denote by F the set on which 
y <0, and claim that 

' '~njF)^,oyw 

Set x = 1 on F, x = 0 off F; with this choice, (4) becomes 

J^(l -y)dfi — j^ydv. (7) 

Since y < 0 on F, the right side of (7) is < 0, while the left side is > this 
proves (6). 

Denote by G the set where )，> 1; suppose that fi(G) > 0. Set x = 1 on G, x = 0 
off G; with this choice (4) becomes 

f f ydv. (8) 

Jg Jg 

Since y > 1 on G, the left side of (8) is negative, and the right side is positive, a 
contradiction. This completes the proof of (5). 

We modify, if necessary, the function y on a set of ^-measure 0 so that (5) holds 
everywhere. Since v is absolutely continuous with respect to fx, this does not af¬ 
fect (4). 

We claim that the function g in (l)is given by g = (1 — y)/y.To see this, denote 
u = xy, and rewrite (4) as 


J ugdfi = 


J udv - 


⑼ 


Let E be any measurable set; we choose x so that u is 1 on 五 ， 0 off E. Then (9) gives 


gdfx= ： v(E). 


( 10 ) 


This is relation (1). 


□ 


Exercise L Prove the Radon-Nikodym theorem for measures that are only a-finite. 









7.2 DIRICHLET，S PROBLEM 

First, let D be a bounded domain in Denote by Cq°(D) the space of real-valued 
infinitely differentiable functions / whose support is contained in a compact subset 
of D. On the space Cq°(Z)) we introduce two scalar products: 

8dx and (f ， g)\ = dx, (11) 

where fj = df/dxj ， j = 1, 

Exercise 2. Verify that Cq°(£>) is an inner product space under each of these scalar 
products. 


The following inequality connecting these two norms is due to Zaremba: 

Lemma 1. For all f in Cq°(D), 

ll/iio<rfil/iil ， (12) 

where d is the width of D. 

Proof. Since / is zero on the boundary of D, at any point x in D, 

m = f x a d Xl 


where x b is a boundary point of D with the same .., x n coordinate as x. Apply¬ 
ing the Schwarz inequality above gives 


f 2 M < d-J \fl\ 2 d Xl . 


Integrating this over D gives (12). □ 

- Denote 七 y • 丑 力 (®)_withTespect to the nomr || j| j, by — 

completion With rCSpGCt to th&* nOFm-|j-jJ 0 •--:. 、二、 -- 二 '= _ =="- ;? ~ ? =- ，二 ITS :-.. —， 

Lemma 2. Every element v of belongs to Hq, and has partial derivatives Vj of 
first order that belong to Hq; these partial derivatives satisfy 


(z t Vj)o = -(dz/dxj t v)o~ 


tl'3)- 


for any Cq° function z. Furthermore formula (11) holds for f、g in Hy 
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Proof. Let {u(”)} be a sequence of Cq° functions that tends to u in the || || i norm. 
That means that the first derivatives converge in the ||||o norm; we cairth'eSB 
limits Vj. By lemma 1, {u(")} converges in the || ||o to a limit in Hq 9 which we identify 
with the limit v in /f】 0 . Integration by parts gives relation (13) for i; ⑻ in place of u; 
letting /2 - » oo gives (13); relation (11) follows similarly. □ 


We claim that the identification of elements v in with elements of Ho is'one-""' 
to-one, that is, ai'embedding of if j 0 in u is zero in Hq, 

then it is zero in H^. Clearly, it follows from (13) that if i; = 0 in Hq, then Vj =0 in 
Ho for all j. This makes v = Oin H^. 

Relation (13) asserts that vj are the first partial derivatives of u.in .the sense_of_. 
distributions (see Appendix B). ， 

Let / be any element of Hq ； define the linear functional t by 


m = («, /) 0 * (i4) 

By the Schwarz inequality, and inequality (12) of lemma 1, 

l««)l<!l/lioll«llo<^l!/lloll«lii (15) 

for all u in Hf. According to the Riesz-Frechet representation theorem, theorem 4 of 
chapter 6, the functional (14) can be represented as an inner product. That is, there 
exists v in Hf such that 

(U, f)o = (u,vh (16) 

for all u in H^. By definition (11) and lemma 2, with My = du/dxj, 

(u,v)] v 7 )o. (17) 

Take now u to be Cg?. We can, using the theory of distributions, rewrite the^right 
side of (17) as 

一 X ] (“， U J7)0 = 一 ( w ，△"()， （170 

where Vjj are the second partial derivatives of u, and A the Laplace operator, acting 
in the sense of distributions. Combining (17)，（170, and (16)，we deduce that 


(«, f)o = —(« ， Au)o 

for all u in Cq°. From this it follows that in the sense of distribution theory, 

/ = -Au. (18) 

Thus i; is a distribution solution of the inhomogeneous equation (18). 

Next we show that by virtue of belonging to v(x) tends to zero in an average 
sense as x tends to the boundary of D. The precise statement is lemma 3: 









Clearly, B is bilinear; it follows from lemma 1 that it is bounded. Estimating the 
middle term by the Schwarz inequality, we see that B is positive in the sense of 









(M, /)o = B(u, v) 


(23) 


by parts on the right in (23), we deduce that 

AV, ..…：. .._ (24) 

s. • 

r method for solving the Dirichlet problem for the homo- 


Au = 0 in D (25) 

ary is prescribed. This method exploits some special prop- 
ms and yields genuine solutions that satisfy the boundary 
use. We assume that dD is once differentiable, and that the 
also once differentiable. We can then construct a C 1 func- 
as the prescribed value on dD, and we state the boundary 


- u = / on 3D. (26) 

iary value problem (25)，(26): decompose / as 

f = v + z 1 (27) 

! z vanishes on 3D. When v and z have continuous first 
ie boundary, we can apply Green’s formula: 

f f dv 

ViZjdxdy = — I (Av)zdxdy + I —zds. 

JD JdD dn 

in D, and by (26), z = 0 on 3Z)，the right side = 0. In 
onic functions and the space of Junctions that vanish on the 
to each other in the ( ， )i scalar product 
position (27) thus appears as the task of splitting / into the 

m two orthogonal function spaces. We show how this can 

aling to theorem 3 of chapter 6. The scalar product defined 
D) is not positive because (/， f)\ =0, not only for / = 0 
ons /. We overcome this slight blemish by considering two 
they differ by a constant. 

ipletion in the || || \ norm of all C 1 functions on DU dD; the 
space of H\ . We apply now the orthogonal decomposition 
apter 6, to conclude that every / in H\ can be decomposed 














uniquely as in (27), where z in u 丄 //f The condition i; 丄 //{) asserts that 

(“ ， i;)l = = 0 

for all u in ^ 0 . Taking u to be Cg°, we can integrate by parts in the sense of distri¬ 
butions to get 

0 =[(〜•， Vj)o = ~ [](“，Vjjh = -(w，Av) 0 , 
which implies that 

Av = 0 

in the sense of distributions. It is a well-known result of Hermann Weyl that a func¬ 
tion harmonic in the distribution sense is harmonic in the classical sense; a proof is 
provided in section 4 of Appendix B. 

We claim that when / is continuous up to the boundary, z{q) tends to zero as 
q approaches the boundary. Denote by d the distance of ^ to 9 D, and let C be the 
circular disc with center q and radius r = d/2 and denote mean values over C by 
bars: taking the mean value of (27), we obtain 

fiq) = z{q)- {- v{q). (28) 

Since / is continuous up to the boundary, its mean value on C differs from its value 
at the center ^ of C by an amount w that tends to zero as d tends to zero. According 
to the elementary theory of partial differential equations, the mean value of the har¬ 
monic function y ona circular disc C is equal to its value at the center of C. So (28) 
can be rewritten as 

f(q) = z(q) + v(q). 

Subtracting (27) from this, we conclude that 

w = z(q)-z(q). (29) 

We appeal now to lemma 3, according to which the mean value over R{p,d) of the 
absolute value of a function z in tends to zero as the distance dofqtodD tends 
to zero. It follows that the mean value of |z| over the disc C tends to zero. Combined 

- wit h -(2 9 ) y _ w e-c an c l _ ud e4hat-^)4tsel£4:~eads 4Q ■ 雜 F o .. as 哥 appFeaehefr-the-beandary： — 

Using.X22)^ejseeJmLthaii.amomafunction,i?4S c.on ... 

——its boundary value equals /• Thus we succeeded in constructing not a generalized, 
biit a genuine solution of the Dirichlet boundary value problem. 

Here is another way of looking at (27). We saw that the boundary value problem 
(25)，(26) for the Laplace equation amounts to decomposing / as a sum of a har¬ 
monic function and_of one. vanisfaing„oiLlhe 上 oundary—We-shaweci-thaU.lies^spaces— 
are orthogonal to each other in the ( , )i scalar product. The decomposition was 
accomplished by appealing to the orthogonal decomposition theorem, theorem 3 of 
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chapter 6. Here we show how to perform an orthogonal decomposition with respect 
otht-subspace V consisting of harmonic functions whose first derivatives are square 
ntegrable in D. It is an easy fact in the theory of harmonic functions that V is com¬ 
pete in the ||||i-norm. So, according to theorem 3 of chapter 6, we can decompose 
my / in H\ as 

(30) 


vhere v is in the space V of harmonic functions with square~fntegrable ffrst den^~ 
ives，and z is orthogonal to V . Our aim is'to show that z vanishes on the boundary. 












As approaches the boundary point b, so does the point q. Therefore the integrant 
on the right in (33) tends to zero uniformly at all points of D whose distance fron 
b exceeds any positive quantity r. It can be shown (see Lax for details) that also thi 










厂 k 1 1 '{ ^ v** ^ r %;\ ''K c ,：^, *' 0’ s dv ， 


-B i 糾 wT^ feMiufii r N 










Since v — h is harmonic, and u = 0 on aD, v — h solves the boundary value problem 

(25), (26). ... 

The preceding proof is a reworking of an argument of Garabedian and Schiffer. 
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8.1 BOUNDED LINEAR FUNCTIONALS 

In this chapter we deal with normed linear spaces X over the real or the complex 
numbers. We will study linear functionals, namely mappings l of X into K or C 
satisfying 


Uax) = atixl £(jc-h 3 0 = 你 ) + 名 0 ，)， ⑴ 

that are in addition continuous. They are continuous in that they satisfy 

lim i(x n ) = i{x) when lim \x n — x\ = 0. (2) 

n-yoo 

Definition. The collection of all continuous linear functionals is called the dual of 
X. It is denoted by X f . 

Clearly, the sum and constant multiple of continuous linear functionals is contin¬ 
uous and linear; thus X f is a linear space. 

Definition. A linear functional ^ on X is called bounded if there is a positive number 
c such that 


Wx)\<c\x\ for all x in X, (3) 

where | | on the left denotes the absolute value. 


Theorem 1. A linear functional £ on X is continuous if and only if it is bounded. 
Proof. Set x n — x = y n in (2); using (1) and (3)，we get 


this shows that boundedness implies continuity. 




Suppose that l is not bounded; then for any choice of c = n, (2) is violated by 
some x n : 

> n\x n l 

Clearly, x n can be replaced by any multiple of if we normalize x n so that 

\Xn \ = 一 7 =， 
y/n 

then x n 0 but t{x n ) oo. This shows that lack of boundedness implies lack of 
continuity. □ 

Theorem 2. The nullspace of a bounded linear functional l on a normed linear 
space is a closed linear subspace. For t nontrivial, meaning ^ 0, the nullspace has 
codimension L 





Theorem 3. The dual X r of any normed linear space is a complete normed linear 
space under the norm defined by (4). 


Proof. Homogeneity and positivity are obvious. For subadditivity, consider two 
bounded linear functionals i and m: ' 


|€ + m| = sup |(€4- m)(x)\ < sup (|^(jc)| + \m(x)\) 


|.v|=l |.v|=l 

< sup \t(x)\ + sup |m(.r)| = \t\ + \m\. 



We show now completeness: let {£ m }.be,a Cauchy..sequence.in X / :_ 

14 一 I 0 as n,m oo. (5) 

According to definition (4) of the norm for functionals and (5 )， _.„ 


\(in - 4i)W 卜 14 W 一 < 14 - 《ml W — 0 as n，m — oo, 





for every x in Z. Since the field of scalars, JR orC, is complete, 

lim 4 (x) = i(x) -- 

；?—>00 

exists. It is easy to show that t(x) is linear and bounded, and it is not hard to deduce 
from (5) that if \l n - i m \ < € for m > n, then also \i n - i\ < 6 . Therefore 

Inn |4 二々 = 0. 一 . —_ □ 


8.2 EXTENSION OF BOUNDED LINEAR FUNCTIONALS 


So far we have not shown the existence of a single linear functional except -g = 0. 
Certainly there are lots of them in a Hilbert space; we show now that there are just as 
many in a Banach space. The tool needed is the Hahn-Banach theorem, specialized 
to the case 


p{x)^c\x\. 


Theorem 4. Let X be a nonned linear space over the real or complex numbers, Y a 
subspace，and t a linear functional defined on Y and bounded there: 

Wy)\ < c\yl y in Y. 

Then l can be extended as a bounded linear functional to all ofX so that its bound 
on X equals its bound on Y. 


This theorem is a special case of theorem 8 of chapter 3. We give now some 
applications. 

Theorem 5. Say that )，]，•.•，），" are N linearly independent vectors in a nonned 
linear space X ， a\ ，…， aN arbitrary complex numbers. Then there exists a bounded 
linear functional i such that 

i{yj) = ay, j = 1,N. ( 6 ) 

Proof. Denote by Y the linear space spanned by 3 ^ ， • • • ， y m \ it consists of vectors 
of the form 

y = J2 b jyj- 

Since the yj are linearly independent, this representation of y is unique. Now define 
£ on 7 by 

^OO = Yl b J a r 

Clearly, t is linear and bounded on Y, and satisfies ( 6 ); by theorem 4, it can be 
boundedly extended to all of X. 口 
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Corollary Every finite-dimensional subspace Y of a nonncJ linear space X has 

a closed complement 

Proof. Choose a basis ^ 1 ，...,in Y. According to theorem 5, there exist N 
bounded linear functionals tj, j = U … ， N ， such that 

according to theorem 2, the nullspace Zy of is closed. So then is their intersection 

z = Zi n... n Z/V. 





|£(z)| < inf |z- 3 >| = m(z). 
yin y 

It follows from this and the definition (11) of M(z) that 

一一 —.： ... . _ 

To show equality, we look at the linear space yo consisting of all vectors of the fc 
y + az t yinY,a complex, and define on Yq the linear functional £.q ： 


^o(y + ^z) = am (z). 

By definition (9) of m, it follows that £q is bounded on Yq by 1; so by theorem A 
can be extended to all of X so that \in\ = 1. Set v = 0, a = 1 in (13): 












MARK 1. In case Y is the trivial subspace consisting of {0), theorem 7 reduces 
orem 6. 

Theorem 7 is an example of dual variational problems: a pair of a minimum a 
laximum problem whose extreme values are equal. 




•• Show that the 
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Proof. For any m in 7 丄， and any y in y with |夕| = 1， 

l^)l = l(«-m)(3；)|<|^m|. (16) 

It follows that |€|y is < the right side of (15). 

According to theorem 4, the restriction of £ to 7 has an extension to X, call it 如， 
whose norm on X equals its norm on Y: 

141 = Wr- (17) 

Since Iq and i are equal on 7, € ~ m belongs to 7 1 ; furthermore by (17), 

\^m\ = \l 0 \ = \Z\ Y . (170 

This combined with (16) proves that equality holds in (15). □ 

Theorem 7’ is another example of dual variational problems. 

Exercise 3. Show that Y r is isometrically isomorphic with X 1 /Y 1 -. 

Definition. The closed linear span of a subset [yj) of a normed linear space is the 
smallest closed linear space containing all yj, that is, the intersection of all closed 
linear spaces containing all yj* 

Exercise 4. Show that the closed linear span of [yj] is the closure of the linear span 

Y of [yj], consisting of all finite linear combinations of the yj ： 

y = Y, a jyj' (is) 

F 

The following result, called the spanning criterion, is one of the workhorses of 
functional analysis. 

Theorem 8 ...A point z of a nonned linear space X belongs to the closed linear span 

Y of a subset {^} of X iff every bounded linear functional i that vanishes on the 
subset vanishes at z ： that is, 

_ -JpraUy_j_ .——_ — —(19).__ 

implies that^{z) = 0. 

Proof. Since t is linear, (19) implies that t{y) = 0 for y of form (18); since i is 
continuous, it vanishes on all limits of points of form (18). Conversely, suppose that 
z does not belong to the closed linear span Y of { 3 ^}; then 

inf — : y| = d > 0 . ( 20 ) 

yinY 
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( 21 ) 


Define the subspace Z to consist of all points of the forr 
y + ciz, y in K, 

and define on Z the linear functional £q by 

^o(y -\raz) = a. 

It follows from (20) that 

\y + az\ >d\a\. 

Combining this with the definition of£o» wei 
So by theorem 4, £ 0 can be extended boundedly to all of X. By definition, 


4 () 7 ) = 0 for all yy, £ 0 ⑵ =1. □ 

N0TE - Theorem 8 is a generalization to Banach spaces of theorem 7 in chapter 6 on 
Hilbert spaces. 


8.3 REFLEXIVE SPACES 

The dual X’ of a normed linear space has its own dual, denoted as X n . Since l(x) is 
a bilinear function of t and x, and is bounded, definition (4), it follows that, for fixed 
a% i{x) is a bounded linear functional of L It follows from theorem 6 that the norm 
of this linear functional is |x|. Thus the space X is, in this natural way, isometrically 
embedded in X". It is a basic result of the theory of finite dimensional vector spaces 
that X n = X. This is no longer true for all Banach spaces. 

Definition. A Banach space is called reflexive if X n = X, that is, if X is all of X n . 

Theorem 9. Every Hilbert space is reflexive. 

Proo f. This is an immediate consequence of theorem 4 in chapter 6. □ 

The following result is due to Milman: 

Theorem 10. A uniformly convex Banach space is reflexive. 

Fnr nrri 0 f we refer to Milman. 

ed in chapter 5 that the L p spaces, 1 < /? < oo, are uniformly convex, 
j this result of Clarkson’s with the preceding result of Milman, we con- 
L p , 1 < p < oo, are reflexive. 
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A* a constant. Setting /= fk in (22) gives 

Ufk) = j fk^dm = J \ul^ l \\u\dni > j \uk\ q dm. 

On the other hand, 

II/a*II^ = j = j \uk\ q dm. 

Since，by assumption, \t(fk) [- < c\f k \ p , the last two inequalities imply that 


J \u k \Um<c (^J \u k \^dn^j IP . 

Dividing both sides by the right side gives 

\Mq < c. 

Letting k oo we deduce thatw is in L q . This completes the proof. 


□ 


Exercise 5. Show that if the total measure equals 1, then ||/j|pisan increasing func¬ 
tion of p. 


Theorem 12. C[—1,1], normed by the maximum norm, is not reflexive. 


Proof. If it were, C would be the dual of C l . According to theorem 6 applied to 
X = C\ for every i in C f there is an / in C" = C such that 

1 名 1 = £(/)， l/lmax = 1* (23) 


Now define 


i(8) 



g(t) dt. 


Clearly, for every g in C [一 1 ， 1 ]， 


W8)\ < 2\ 8 \, 

but given any € > 0, we can choose g so that 


(230 


剛 >(2 — 侧 max . 


This result shows that \l\ = 2. Along with (23，)，it contradicts (23) for g =/. □ 

Theorem 13. Let Z be a normed linear space over C. If Z! is separable, so is Z. 






The conclusion of theorem 12 is applicable to the space C(Q), Q any Hausdorff 
space containing more than a discrete set of points. This is the precise state of affairs: 
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Theorem 14. Ler Q be a compact Hausdolff space, C(Q) the space 
real-valued functions on Q, nonned by the max norm. 

(i) C / consists of all signed measures m of finite total mass，defined overall Borel 
sets. That is, every bounded linear functional i on C(Q) can be written as 

认 f)= L fdm . (2?) 



W= f \dm\. (28) 

Jq 

The measure m is uniquely determined by L 
(ii) C n is L°°{Q) t the space of all bounded, Borel-measurable functions on Q. 

The prototype of this basic result is due to F. Riesz; the general result is due to 
Kakutani. A functional analytic proof for Q metric is supplied in Appendix A. 

NOTE. Theorem 14 is emphatically false when Q is not compact and C(Q) is the 
space of all bounded continuous functions on O, normed by the sup norm. Here is 
what happens: 

Take Q to be the real line K，{/々} a sequence of points -> oo. We take Y to be the 
subspace of C(M) consisting of all functions / for which 

lim/(4) = /oo 

exists. For f in Y we define the functional £ by 
^(/) = /oo* 

Clearly, i is linear and i is bounded on Y: \i\y < 1 . By the Hahn-Banach theorem, 
l can be extended to all of C(R) as a bounded linear functional. 

We claim that this i cannot be of the form (27). If it were，the value of €(/) would 
depend on values of / on any compact interval I on which 

But clearly, we can alter the values of / in y on / without changing the value of 
£(/); so there cannot be such a dependence. 

The following result is of some interest: 


Theorem 15. A closed linear subspace Y of a reflexive Banach space X is reflexive. 





Proof. Every bounded linear functional i on X, when restricted to Y, becomes 
a bounded linear functional on 7; we denote this functional by Since by Hahn- 
Banach every bounded linear functional on Y can be extended to X, this restriction 
map i 


f f 

maps X f onto Y\ The restriction map induces the following mapping from Y" to X f, \ 
For any rj in Y" we define ( in X" by setting, for any l in X\ 

m = 咐 0 )， (29) 

where £q is the restriction of l to Y. Since X is reflexive, f can be identified with an 
element z of X: 


setting this into (29) gives 

i(z) = 咐 o)* 


(290 


We claim that z belongs to Y. To show this, we note that if l belongs to meaning 
it vanishes on Y, then to = 0, and so by (290, Uz) = 0. We appeal now to theorem 
8 to conclude that ^ belongs to the closure of Y. But since Y is closed, z belongs to 
F. So we can rewrite (29’）as 


to(z) = _)• (30) 

Since every functional in Y' occurs as Iq, (30) shows that every r] in Y n can be 
identified with some z in Y. □ 


8.4 SUPPORT FUNCTION OF A SfiT 

We recall from chapter 1 the notion of the convex hull of a pointset M in a linear 
space X over the reals as the smallest convex set in X containing M, that is, the 
- conyex_sets_r h ai_c:ont:ai n The-Convex-hulLof. M 」 s. denoted. 

一：， by” 脱 ’ _...... 

As remarked in theorem 6 of chapter 1, M consists of all convex combinations of 


points of M. These are points of the form 


x = y^ i a j x ji in M 

(31) 

a i ^ 0 - Y^ a J = L 

(310 










convex 
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smallest closed convex set containing M, that is the intersection of all closed convex 
sets containing M. We denote this set as M. 

Exercise 6. Show that the closed convex hull of M is the closure of the convex hull 


_ Definition, For any bounded subset M of a normed linear space_X over R,.we define 

the support function S 似 as the following function on X f : 

%W= sup £(y). (32) 

. .. . y in M 

Theorem 16. Support functions have the following properties: 

(i) Subadditivity, for all t, m 'in X\ -f m) < Sm^X) + 

⑻ %( 0 )= 0 . 

(iii) Positive homogeneity, Sj^(^) = aS^i^for a > 0. 

(iv) Monotonicity, for M C N t 

(v) Additivity, Sm+n = 

(V/J = %(—€). 

- (vii) %=%• 

MU) =S M . 

Exercise 7. Prove theorem 16. 

We give now some examples. 

(a) M consists of a single point jco, 

S [XQ] a) = i(x 0 ). 

(b) M is the ball Br of radius R around 0: [\x\ < 

S Br (£) = RW. 

(c) Mis the ball B^(xq) : {|x — jcoI 5 沢 }，using examples (a) and (b)，and part (v) 
of theorem 16, we get 

Sb r (xo)^)=Ux 0 ) + RW- (33) 

Theorem 17. X is a normed linear space over R, M a bounded subset ofX.A point 
z ofX belongs to the closed, convex hull M ofM iff for all £ in X\ 


( 34 ) 
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Proof. By definition (32) of support function, for all i in X f and any in 
l{z) < By parts (vii) and (viii) of theorem 16, so that {M) is 

satisfied for all z in M. 

Conversely, suppose that z does not belong to M. Since M is closed，some open 
ball Br{z) centered at z does not intersect M. By the extendedhyperplane scporarion 
theorem, theorem 6 of chapter 3, there is a nonzero linear functional l ; o and a real 
number c such that 

^ < ^0(v) 

for all u in M, all v in B R (z). It follows from the right half of inequality (35) that 
is a bounded linear functional. ， 

The points v of Br{z) are of the form i; = z 4- Rx, |.t| < 1. By the right half of 
inequality (35 )， 

c < £q(z) + R^o(x). 

It follows from the definition of the norm of a linear functional that 


inf 名 o(x)== 一咖 ; 

W<t 

it follows from the inequality above that 

C<g 0 ( ： )-^l«ol- ( 36) 

From the left half of inequality (35) and the definition (32) of Sm ，we coiK'lutlc that 

s M m < c. ( 36 ') 

Combining (36) and (36’）gives 

%( 《 0) + 尺 Kol 5 b ⑵. （ 37) 

Since-€o # 0, |«ol > 0; thus-(37) shows that if s does not belong to (34) fails for 

some ?o, as asserted in theorem 17. ^ 

Theorem 18. K denotes a closed, convex subset of a real linear space X, :. ci point 
of X not in KrTHeh •—.--- 

inf \z - u\ = sup [£(:) — 5/c(£)]. ( 38 ) 

u in K jf|=l 


Proof. By definition (32) of support function ， 


So for \i\ = 1, 


SkW > «(w) for all 6, all u e K. 





We use npw additivity anc 


-她). 


( 40 ) 


. SK^B R ao) = s K (e Q ) + R\e 0 i 

Since we may choose to have norm = 1, it follows from (40) and (40’）that 
R < 物 - S K (i 0 ) 

for some £o with \£q\ = 1; it follows from this that the sup on the right of (38) is 
> /?• Since R can be any number less than the inf on the left of (38), it follows that 

|w - z| < sup [€fe)-%(€)]. (39 7 ) 

Combined with (39) this proves (38). □ 

Theorem 18 presents another example of dual variational problems. Theorem 7 
is a special case of theorem 18, if we extend the definition of support function to a 
linear space F: 

SyW = 


0 if£ in Y 1 
00 if € not in y 丄 . 
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APPLICATIONS OF DUALITY 


9.1 COMPLETENESS OF WEIGHTED POWERS 

Let w(t) be a given positive function defined on E that decays 
|r| oo ： 

0 < w(t) < ae~~ c ^\ c > 0. 


exponential 1 


Denote by C the set of continuous functions on E that vanish at oo: 

lim x(t) =： 0. 

⑷— Od 

C is a Banach space under the maximum norm. 

Theorem 1. The functions t n w(t) belong to C; their closed linear span is all 
That is, every function in C can be approximated uniformly on K /?y weighted t 
nomials. 

Proof. We will use theorem 8 of chapter 8. Let t be any bounded linear funct 
over C that vanishes on the functions t n w: 

i(t n w) = 0, = 0, 1.... 

Let f be a complex variable, |Im f | < c. Then w(t)e 1 ^ belongs to C, and so 



is defined in the strip 11 
complex difference quoti 





9 - 2 THE MUNTZ APPROXIMATION THEOREM 

According to the Weierstrass approximation theorem, any co 
on the interval [0, 1] can be approximated uniformly by polj 
any integer. Clearly, if x(/) is continuous on TO. II is 



DW )’(幻 can be approximated arbitrarily closely in the maximum norm by polyno- 
ials p(s). Setting s = t n , we conclude that x(/) can be approximated arbitrarily 
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closely by linear combination of fJ n , 7 = 0,1, — Thus in the Weierstrass approxi¬ 
mation theorem not all powers of t are needed. 

Serge Bernstein posed the following question: What sequences of positive num¬ 
bers {Xj} tending to 00 have the property that the closed linear span of the functions 

1 ， {^}, j = l,2,... (6) 

is the space C of all continuous functions [0,1]? After some preliminary results were 
obtained by Bernstein, Mlintz proved the following: 

Theorem 2. Let kj be a sequence of positive numbers tending to 00 . The functions 
(6) span the space of all continuous functions C on [0,1] that vanish at t = 0 Iff 

E 六. ⑺ 

Proof. We will use the spanning criterion, theorem 8 , chapter 8 . Let ^ be a 
bounded linear functional on C that vanishes on all the functions ( 6 ): 

e(t^) = o, 7 = 1 , 2 ,.... ( 8 ) 

Let f be a complex variable. Re f > 0. For such fbelongs to C and depends 
analytically on f, in the sense that 

— = Qogt)P 

exists in the sense of the norm in C, the maximum norm. Define 

/(?)= 以 〆) • ⑼ 

It follows that / is an analytic function of Furthermore, since |^| < 1 when 
0 < ? < 1 and Re f > 0, and since £ is bounded, say \i\ < 1, it follows from (9) that 

i/(OI < 1 ' forRef > 0 . ( 10 ) 

Relation ( 8 ) can be expressed as 

/(^y) = 0 ._ _ .(ID 


( 12 ) 


(13a) 


We define a Blaschke product 5"(() as follows: 

" izh. 


5"(o=n 


It has the following properties: 


lim ■ 
8-^0 


. 


== 0 , 








criterion, we conclude that all functions t k can be approximated uniformly by linear 
combination of the functions {^'}. Taking in particular/: = 1 ， 2,3, … and appealing 
to the Weierstrass approximation theorem, we conclude that the functions (6) span C. 
We omit the proof of the necessity of condition (7). □ 


NOTE. Szasz has extended Muntz's theorem to comp 
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9.3 RUNGE，S THEOREM 

Theorem 3. Let D be a bounded simply connected domain in C. Every analytic 
function f (f) in D can be approximated, uniformly on compact subsets K } by poly¬ 
nomials in f. 

Proof. Since D is simply connected，every compact subset of D is contained in a 
simply connected compact subset K of D. Choose a closed smooth curve in D — K 
that winds once around every point of 尺， and express /(f), f in K, by Cauchy’s 
integral formula. This integral can be approximated, uniformly for all points f of K, 
by a sum. This sum is a linear combination of functions of the form (x — f> X 
on the curve. Therefore to prove the theorem, it suffices to show that all functions of 
f of form (x - f)*" 1 , x not in K, can be approximated on K by polynomials in f . 
This is clear when |x l > R, R = max|f|, < in K. Then the geometric series 

oo w/1 

converges uniformly on [ To show it for all f, we will use the spanning criterion. 
Let i be any bounded linear functional on C(K) that vanishes on all polynomials: 

i{p) = 0. 

We claim that € vanishes on all functions of the form (X — ?)— 1 ， X not in 尺 . Define 

Since (x - f) 一 、 as element of C(K), depends analytically on x，it follows that 
g(x) is an analytic function of x in the exterior of K. Since for lx I > 及 ， （X 一 C)— 1 
belongs to the closure of polynomials p, and since t{p) = 0, it follows by continuity 
that f) 一 1) = 0 for such x ? and so 

尽 (X) = 0 , for|/?| < |x|. 

Since g is analytic in the exterior, and since the exterior of a simply connected set 
K is connected, it follows that g{x) = 0 for all x not in K. Then, by the spanning 
criterion, theorem 8 of chapter 8, for all x outside K, (x — f)"" 1 is in the closure of 
― yae-spaGe-of-pol-ynornials^ -……- - -- - -- Q_ 

This beautiful proof is due to Lars Hormander. 


9.4 DUAL VARIATIONAL PROBLEMS IN FUNCTION THEORY 

Theorem 4. Let D be a bounded domain in C，whose boundary consists of a finite 
number of C x arcs. Denote by A the space of functions analytic in D and continuous 








m = max \£(u 0 )\, 


We denote by i Q an i that maximizes (21). We 
9Z3by 


Mx) = ^0 
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We claim that /o has the following properties: 


(i) /o(x) is analytic for x noton dD. 

(ii) /o(x) = 0 for x not in D. 

(iii) I/0(X)I < 1. 

(iv) /q(?o) = m. 


Property (i) follows from the fact that l/(f - x) as an element of C depends analyt¬ 
ically on x- Property (ii) follows since for x not in D, l/(f - x) lies in A, and i 0 
vanishes on A. To prove (iii), choose x near dD, and take x / to be the reflection of 
X across D. x' lies outside D. We will use (ii) to write 


MX)= 


fo(x) — foix') = r—r^o 




Since |4I < 1- 


l/o(x)l < — I— --- I =— f — lx ~ — I 奸丨 

A simple estimate shows that the integral on the right is < 1 + e for x near the 
boundary. So|/o(x)| < 1+^forx near the boundary; from this (iii) follows by the 
maximum principle. 

Differentiate (22) and set f = fo; using (18) and (21), we get 


/o(fo) = ^o(«o) = rn, (23) 

Since we have already shown that |/ / (fo)l <wfor|/| < 1 in £>, it follows that / 0 
solves the maximum problem (17), and that M = m. 口 


Suppose that D is simply connected. We claim that /o, the function maximizing 
(17), maps D conformally onto the unit disc. Here is a sketch of a proof: 

」t can be shown that the minimum probl^n (19) has a solution, call it ^ 0 * Setting 
/ into (20)7aiid using (23^ we get 


f dD Muo-go) d ^ds, 


〜 where^.is:arclengthv ， Using:the-faGt that |-/o(?)|- <-1, and that, by (19), 


we conclude that l/p(j：)| = londD and that _ 

/o(«o- 肋) f > 0 . on 3D. 


(24) 











Denote by 2 tt[/z] the change of argument of a nonzero complex valued function h 
—around* 9 Z). From (24) we deduce that 

[/0] + [«0 10] + 图 =0. (25) 

.■According.to ,the..argument principle, for the boundary values 力 . of functions mero- 
morphic in D, __ 


[h] = # zeros - # poles in D. 

But /o and gQ_ha.vt no poles, and uq has a single pole of order 2. For D simply 
connected, [d^/ds] = 1; so we deduce from (25) that 

# zeros of /o + # zeros of (wo - go) — 2 + 1 = 0. (26) 

It follows from (26) that /o has at most one zero in D. We claim that it has at 
least one zero; otherwise, / 0 一 1 would be analytic in D. Since |/o| = 1 on 3Z), it 
would follow from the maximum principle that |/o(?)| = 1 in D, which implies 
that /o e const., contrary to / 父如） =/n. Combining the two statements above, we 
conclude that /o has exactly one zero in D. 

According to the argument principle, [/q] equals 1. Since |/o(f)| = 1 for f on 
3D, [/q — w] = l for all w inside the unit disc; it follows that /(f) takes on every 
value w exactly once in D. This shows that /o maps D one-to-one onto the open unit 
disc. 

The arguments used in this section combine methods introduced by Rogosinski 
and Shapiro with results of Garabedian and Schiffer. 


9.5 EXISTENCE OF GREEN’S FUNCTION 

Definition. Let D be a plane domain whose boundary B is once continuously dif¬ 
ferentiable. Green’s function G(p, q) of the domain D is defined for p,q in D by 
the requirements that 

(i) A P G = S(p — q), where A p is the Laplace operator with respect to p, and S 
is the Dirac distribution, see Appendix B. 

(ii) G(p ， q) = O for p on B. 

In this definition, the variable p and q play unsymmetric roles, cj appearing merely 
as a parameter in a boundary value problem. 

The significance of Green’s function is that it can be used to represent every har¬ 
monic function It in D in terms of the boundary values of h by using Green’s formula: 

h{q) = f h(p) G n (p, q) dp, 

JdD 
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where G n (p, q) is the derivative of G with respect to p in the direction normal to 
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APPLICATIONS OF DUALITY 


(i) k(w) is a differentiable function ofw, and satisfies 

.A w k = 0 (33) 

in each component of,the complement of B. 

(ii) For w in the exterior of D, k{w) belongs to H. 



giw, q) =i q (k(w)). (34) 

where A: is defined by (32). 


Lemma 5. 

(i) g(w, q) is a harmonic function of w in each component of the complement 
ofB. 

(ii) For w f in the exterior of D ， 

g{w',q)^\oi\q -w'\. ' (35) 

Proof Since l q is linear, by (34), 

„ f k(w + du) ~ k(u>)\ g(w + du,q) - g(w, q) 

H i )= d _ 

We let d tend to 0. Since t q is bounded, we deduce that 

iq(d w k) = d w g(w, q). 

Applying this to second derivatives and using (33), we get 

A w g = l q (A w k) = 0, 

as asserted in part (i). . 

For w f in the exterior of D, k(w f ) belongs to H. Applying the original defintion 
(30) of ~ in (34) yields (35). □ 


Lemma 6. g(w, q) depends continuously on w as w crosses the boundaiy. 






KX1STENCE OF GREEN’S FUNCTION 


By definition (34) and since i q is linear, 

giw, q) - g(w', q) = i q (k(w) - k^w 1 )) = t q 


log 


\P~w\ 
\P - w'\ 



It is easy to verify that since the boundary B has a continuously turning tangent 


\P~w\ { 

\P - w'\ 

as w tends to the boundary B, uniformly for all points p in B. It follows that 


log 


\p~w\ 
\P - w'\ 


-> 0 


(3 ； 












Remark 2. In case of a boundary curve B that is twice differentiable, relation (37) 
can be sharpened to 



where d = |iy — /?| is the distance of w to the boundary. Using this, we can sharpen 
(38): - ; ^ 

We set this into (36). Since by (35), g(w\ q) == log|^-w;|,it differs from g(p ， q)= 
\og\q — p\ by 0(d). So we conclude that as it; in D tends to the nearest boundary 
point p,. 

\g(w y q)-g(p,q)\<0(d). (41) 

We claim that the first derivatives of g are uniformly bounded in D up to the bound¬ 
ary. This is because we can express the first derivatives of the harmonic function g at 
w as integrals over the circle of radius d centered at w: 

In grad g (w) = ^J [g(iu + de(Q)) - g{p)]e(fi)dB, (42) 

where e{Q) = (cos0 ， sin@). It follows from (41) that the integrand in (42) is 0(d)', 
from this the uniform boundedness of grad 容 follows. 

We know from the Cauchy-Riemann equations that the conjugate harmonic func¬ 
tion to g also has uniformly bounded first derivatives in D. This shows that for D 
simply connected, the analytic function mapping jD onto a disk is uniformly Lips- 
chitz continuous. 

Green’s functions of more than two variables were constructed in Lax. 
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WEAK CONVERGENCE 


Definition. A sequence {.r /z } in a normed linear space X is said to converge weakly 
to x if 

lim t{x n ) = i{x) (!) 

n-^oo 1 

for every l in X f . This relation is indicated by a half arrow: 

Xn-^X. (| ; ) 

Another notation is 

W - n l ^ co X n= X - (I") 

The notion is to be contrasted with converge in sense of the norm: 

- >| = 0 . ( 2 ) 

In this case we say that [y n ] tends strongly to y, and denote it.as 

J/i — ^ y- (2’） 

Another suggestive notation for strong convergence is 

S ~^oo yn=y - ( 2 〃） 

— Clearly, a sequence that converges to x strongly also converges weakly to x. hut 
in general not vice versa. Here are some examples: _ _ _ 

Example L X = £ 2 '; its points are vectors -- —~ . - 

^ = (3) 

with denumerably many components, such that 

IWI 2 = X1K/| 2 < oo. ( 3 f ) 


4 


?• 
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Since is a Hilbert space, according to theorem 4 of chapter 6 all bounded linear 
functionals on € 2 are of the form 

€(x) = (x, y) = J2 a J b J' l fe yl 2 < (4) 

-Define X/r-as-the /zth-unit vector, that is, the vector whose nth component is 1, all 
.others zero : ^ = (0, •. • ，0,1， Q,...). It is easy to show, and left as an exercise, that 
the sequence fa} tends weakly torero, but not stt*ongiy. 

Example !, ii/ any Hilbert space, {j: / z } an orthonormal sequence; such a sequence 
tends weakly, but not strongly, to zero. 


Proof. It follows from Bessel’s inequality (31) of chapter 6, that for any y in H 
X^KJCn.y)! 2 < llyll 2 . (5 


from which it follows that 


i{x n ) = {x JU y) 0. 


(50 


Since according to the Riesz representation theorem, theorem 4 in chapter 6, all linear 
functionals are of the form (50, weak convergence to zero follows. Since \\x n \\ = 1 
for all «， {x n } does not tend to zero strongly. □ 


Example 3. X = C[0,1]. 
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which implies that |^| < 4 for all K. From (6) it follows that 
K 

= > KS - 
1 

Since this holds for all K, and since \yK\ < 4 for all K, the boundedness of i is 
contradicted. Since |^| = maxx n (0 = 1, {.r n } does not tend to zero strongly. □ 

Exercise L Prove inequality (7). (Draw a picture of the graph of^(O). 

Example 4. X = l { , consisting of vectors x = 1^1 = Yl\ a k\ < oo. As 

we saw in chapter 8 (exercise after theorem 11), the dual of is t°°. The following 
observation is due to Schur: 

If a sequence {^ ；1 ) in i x converges weakly, it converges strongly. 

Exercise 2. Prove the preceding statement. j.T.U MERKcZ NEbi 

Demirb3-$ Ho ...Ti.hi.^3 / . 

10.1 UNIFORM BOUNDEDNESS OF WEAKL^ Qt Gir ' 5 . . 

CONVERGENTSEQUENCES 

The following result is useful in proving weak convergence: 

Theorem 1. Suppose that a sequence {jc, z } of points in a nonned linear spacejatis^ 
fies 

(i) { |知 I} are uniformly bounded: 

M <c. 

(ii) limi(x n ) = i{x) for a set of i dense in Then 

w — limjc,, = x. 

Exercise 3. Prove theorem 1. 

… Surprisingly, the -converse-oftheoreTn- Hre r ldy i irBaiiadi spacgsrTo see~thfe, we - 

appeal to the principle of uniform boundedness for a- complete^ metric space" •Srifrr"™ - 
collection [f v ] of continuous real-val.ued..functipns../ u on .5 is._b.punded„at. each point 
x of 5, 

\Mx)\ < M(x) for all y, 
then the functions f v are uniformly bounded, 

\Mu)\ < M, 


( 8 ) 

(80 
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WEAK CONVERGENCE 


for all u in some nonempty open set O. We specialize this to the case where S is a 
Banach space X and each f v is subadditive and absolutely homogeneous: 

/(^ + )0 < fM + /(•、，)， f(ax) = \a\f(x). (9) 

Theorem Let X be a Banach space, [f v ] a collection of real-valued continuous 
subadditive and absolutely homogeneous functions on X, bounded at-each^point 

X .as in.(8). Then "ze. {/ v }. 瓜艺么 - " 7 如祕 - 

that 

\Mx)\<c\x\ (10) 

for all f v and all x in X, * 

Proof. By the principle of uniform boundedness for metric spaces, |/ y («)l < ^ 
for all f v and all u in some open ball u = z + y, |^| < r. Using subadditivity, we 
have that 

1/uOOI = \Mu^z)\< \Mu)\ + \Mz)\ < 2M (11) 

holds for all y with \y\ = r/2. For any x in X define y by y = }^x/2\x\. By con¬ 
struction, |v| = r/2, so (11) holds. Using absolute homogeneity we get from (11) 
that 

I 叫 A (宇 . V ) 卜宇彻 ^ 

this proves (10), with c = 4M/r. □ 

An immediate consequence of theorem 2 is 

Theorem 3. X is a Banach space, {£,,} a collection of bounded linear functionals 
such that at every point x ofX 

\tv{x)\ < M(x) for alU v . (12) 

Then there is a constant c such that 

\tv\<c foralU,,. (13) 

Proof. |£(x)| is a continuous subadditive and absolutely homogeneous function of 
x. Therefore theorem 2 is applicable; its conclusion (10) yields (13). □ 

Another immediate consequence is 


Theorem 4. X is a nonned linear space, a collection of points in X such that 
for every bounded linear functional t 
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\i(x v )\ < M{i) for all Xv (12') 

Then there is a constant c such that 

M < c for all x v . (13’) 

Proof. Theorem 4 follows from theorem 3 applied to the Banach space X\ on 
which the elements x v of X act as bounded linear functionals. 口 

An immediate consequence of theorem 4 is 

Theorem 4’. A weakly convergent sequence {.r,,} in a normecl linear space X is 
uniformly bounded in norm. 

Proof. Weak convergence means that t(x n ) is convergent for every i in Since 
a convergent sequence of numbers is bounded, hypothesis (12’）of theorem 4 is sat¬ 
isfied; therefore (13’）holds. □ 

Theorems 2, 3, and 4 are called the principle of uniform boundedness. 

Theorem 5. Let be a sequence iri a nonned linear space converging weakly 
to x. Then 

|x| < iim inf |^|. (14) 

Proof. According to theorem 6 of chapter 8, there is an l in X\ such that 
1 和剛 ， m = 1. 

Since weak convergence means that 

and since 

\Ux n )\ < \i\ |^| = |x«|, 

(14) follow^___g 

The following far reaching generalization of theorem 5 is duelo Mazur ^ 

Theorem 6. Let K be a closed, convex subset of a nonned linear space X, {.r/,} a 
sequence of points in K, converging weakly to a point x. Then x belongs to K. 

Proof. Let Sk be the support function of K, defined by equation (32) of chapter 8 
as sud. ；„ t(x). It follows from that definition that for anv P. in X f 





ccording to. theorem 17 of chapter 8, this guarantees that a: belongs to K. □ 

'ise 4 t -Deduce theorem 5 from theorem 6 applied to balls-Gentered -at-the-origin- 
Br : [x\ \x\ < R}. 


WEAK SEQUENTIAL COMPACTNESS 

ition. A subset C of a Banach space X is called weakly sequentially compact 
sequence of points in C has a subsequence weakly convergent to a point of C. 

、ise 5, Show that a weakly sequentially compact set is bounded. 

e importance of weak sequential compactness is the same as that of compact- 

in the sense of strong convergence. Weak compactness is a valuable tool in 

ructing, as weak limits, mathematical objects of interest. To wield this tool, we 
simple, easily verifiable criteria for weak compactness; the following is such a 
ion: 


3anach space X the closed unit ball is weakly sequentially 
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and so (16) says that for all m in Y\ m(z n ) tends to m(y) as n oo. Since the 
restriction of any t in X f to Y is an m in Y\ this proves that z n converges weakly to 
a point y in the unit ball. 口 

Note the sharp contrast between theorem 7 and theorem 6 of chapter 5, according 
to which the unit ball is never compact in the norm topology. Compactness is gained 
by replacing strong with weak convergence. 

Eberlein has proved the converse of theorem 7: 

Theorem 8, The closed unit ball in a Banach space X is weakly sequentially com¬ 
pact only if X is reflexive. 

Combining theorems 6 and 7 gives the following useful result: 

Theorem 9. In a reflexive Banach space every bounded, closed, convex set is weakly 
sequentially compact 

Here is a useful application of theorem 9. 

Theorem 10. Let X be a reflexive Banach space, K a closed、convex subset ofX, z 
any point ofX. Then there is a point y ofK which is as close to z as any other point 
ofK. 

Proof. We may take 2 = 0, and assume that 0 ^ K. Denote by s the distance of 0 
to K, that is, 

j = infM， yin K. (17) 

Let {y^} be a minimizing sequence for (17). We may assume that each y n lies in 
the intersection of K and that the ball of radius is 2^ around the origin. This is a 
bounded, closed, convex set, therefore, by theorem 9, a subsequence {^} of {y n } 
converges weakly to some point z of K. According to theorem 5, 

\z\ < hm inf|z /z |. (18) 

Since {z,,} is the subsequence of a minimizing sequence, lim| 2 rt | = s. Combining 
this with (17) and (18) gives \z\ = s; that is, z is _a point of K closest to 0. □ 

Tlieorem i0 is a generalization of theorem 8 of chapter 5. There we assumed that 
X\s uniformly convex; here we assume only that X is reflexivS：^" 


10.3 WEAK* CONVERGENCE 


In a Banach space U that is the dual X' of another Banach space X 9 there is a subclass' 
of linear functionals associated 











A weak til convergent sequence {m,!} of points in a Banach space V = 
ly bounded. 

* convergence implies that the boundedness condition (12) holds at 
f X. So theorem 3 implies (13)，uniform boundedness. □ 






The following important result is due to Helly: 


rheorem 12. Let X be a separable Banach spac 
U is weak* sequentially compact. 

Proof, Given a sequence {u n ) in U, 

1^1 < 1 , 

ind a denumerable set {.r^} in X, we can, by i 


quence {v«l of {«,] such that 


lim v n (xk) 

n-^oo 

exists for all Xk- It follows from (23) and (24) that 
lie in the closure of the set {^}. So, if we take foi 

to a limit for all x in X. It is easy to see that this 

using (23), that it is bounded by 1. 
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APPLiCAT-IONS OF 
WEA&eGNVERGENGE ~ 


11.1 APPROXIMATION OF THE 8 FUNCTION 
BY CONTINUOUS FUNCTIONS 


Definition. A sequence [k n ] of continuous functions on [—1 ， 1] tends to the 5 func¬ 
tion if 

一上 % jf_: / ⑴ k n {t)dt = /(0) (1) 

for all continuous functions f on [—1 ,]]. 

Theorem 1 (Toeplitz). The sequence [k n ] of continuous functions on [一 1 ， 1] tends 
to the 8 function in the sense of(\) if and only if it satisfies the following conditions: 

(i) 

lim = (2) 

(ii) For eveiy C°° function g whose support does not contain 0, 

lim f l g(l)k n (t)dt^0. ⑶ 

oo t /一 ！ 

(Hi) There is a constant cfor which 

f\n(t)\dt<c ⑷ 

holds for all n. 
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DIVERGENCE OF FOURIER SERIES 


Proof. Suppose that /(0) = 0; let g be a C 00 function that differs from / by 1( 
than e at all points t in [-1, 1], and that is zero in some interval around f = 0. T1 
by (4 )， 

( (/ <e J \k n \dt < ce. 

By assumption (3), J gk n dt tends to zero, so it follows from (5) that 
limsup|y fk n dt\^ < e. 

Since e is arbitrary, (1) is verified in case /(0) = 0. Every function can be deco 
posed asb + f,b some constant and /(0) = 0, so (1) follows from (2) for evi 
continuous /. 

Now to the converse: condition (2) is clearly necessary, for it is a special case 
(1) when f(t) = 1; the same goes for (3). 

We can regard [k n ] as a sequence in C’[— 1 ， 1], and state (1) as 

w* ― lim^j = 5. 

According to theorem 3 of chapter 10, the norms 

\K\ = J \k n {t)\dt 

must be uniformly bounded. This proves the necessity of (4)，and even more: 

Corollary 1’. If (4) is violated, there exists a continuous function f for which i 
left side of(l) tends to infinity. 


11.2 DIVERGENCE OF FOURIER SERIES 


Theorem 2. There exis 
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APPLICATIONS OF WEAK CONVERGENCE 


The convergence of the series at, say, 0 = 0 means that 


Using (7'), we can write 

N 

/(0) = Jim YL a >'- 

(8) 


^ 厂丌 


where 

—n J -兀 

⑼ 

- : ~~~ 1 —— -—— …-——-—— ' 

N 



2nk N m = jy- in() . 

一 N 

(10) 

Using the formula for the 

sum of a finite geometric series, we readily get, for 0^0, 


n , ^ sin(A/ +1/2)0 
lnk _ = sine/2 - 

(100 


Thus the convergence of the Fourier series of every continuous function is equiv¬ 
alent to the sequence {/:„} defined by (10) approximating the 5 function. According 
to theorem 1 this is the case iff con3itions (2), (3), and (4) are satisfied. We show 
now that condition (4) fails! To see this, we use the inequality |sin</>| < |^|, which 
implies that |l/(sin0/2)| > 2/|0|. Using (10’) and a little calculus, we get 

This last integral is, it is easy to show, > const, log 从 Thus condition (4) fails, and 
so, by Corollary 1’， for some function /， the Fourier series of/ diverges at 0 = 0 to 
infinity. □ 

Exercise L Show that there exists a continuous periodic function whose Fourier se¬ 
ries diverges at n aribtrarily given points. 

11.3 APPROXIMATE QUADRATURE 

An approximate quadrature formula is an approximation to the integral of a contin¬ 
uous function / on, say, [ 一 1 ， 1]. Take N points ij in | — 1 ， 11，called nodes, and N 
numbers Wj called weights .，define q{f) by 

N 


9 (/) = 


(id 






)STRONG ANALYTICITY OF VECTOR-VALUED FUNCTIONS 


q as an approximation to 


J 

3. Let be a sequence of quadrature formulas of form 
Ing conditions: 


every nonnegative integer k, 


^irn^qN^) 


all N t 


constant. Then 


N 


Hm q N (f)= / 

/-»00 

for all continuous /. Conversely, if (14) holds for all continuous /, (12) 

(13) must be satisfied. 

Proof. It follows from (12) that (14) holds for all polynomials /. Inequality (13) 
asserts that the linear functionals qN on C[— 1 ， 1] have uniformly bounded norms. 
Since the polynomials are dense in C[—1, 1], (14) follows for all continuous /. The 
converse follows from theorem 3 of chapter 10 • 口 

Exercise 2. Prove that if the weights Wj are positive, (13) follows from (12). 


11.4 WEAK AND STRONG ANALYTICITY OF 
VECTOR-VALUED FUNCTIONS 


Let / be a function defined in some domain G of the complex f plane, whose values 
lie in some complex "Banach space^T 一 "■― 一 ’ 二 ㈣ = 
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APPLICATIONS OF WEAK CONVERGENCE 


Definition. f(^) is weakly analytic in G if for every bounded linear functional ^, 
£(/(f)) is an analytic function of C in the classical sense. __ 

N. Dunford has proved the following surprising result: 


Theorem 4. A weakly analytic function is strongly analytic. 






definition (17) of xi u k this implies the norm inequality 




n ... 埋 - /(H <c|/:-A1. (18) 

Since X is complete, it follows that the difference quotients of /(f) tend to a limit 
in the strong sense. 口 


11.5 EXISTENCE OF SOLUTIONS OF PARTIAL 
DIFFERENTIAL EQUATIONS 

We denote by La first-order partial differential operator of the following form, acting 
on vector-valued functions: 

m 


L — A j 3j + B. 


(19) 










EXISTENCE 


F PARTIAL DIFFERENTIAL EQUATIONS 


Here Aj and B are square matrix valued functions of the independent variables s 
Aj(s) being once differentiable, B(s) continuous, and 




We assume, for simplicity, that Aj and B, as well as the functions on which L 
are periodic in all variables s and that they are real valued. We denote, as customary ， 
tht formal adjoint of L by L*: 

L* = -J23jA] + B T , (190 

where A r , B T denote transposes. Integration by parts shows that for any pair of C 1 
vector-valued periodic functions u and v, 

(v,Lw) = (L*v,iO. (20) 

Here the bracket denotes the L 2 scalar product over a period cube: 


(v t w )= 


v(*y) • w(s) ds; 


the dot denotes the dot product between vectors, and F a period cube. 

Suppose that every Ay is symmetric: Aj = Ay. Then comparing (19) and (190, 
we deduce that 

L* = -L-J2 a JJ +B + B t . 

Here Ajj denotes the partial derivative of Ay with respect to Sj. Setting this into 
(20), and choosing v = u, we get 

2(«, Lu) = ((L + L*) u, u) = ([B + B T u, m) . (20 ，） 

Using the language of distributions, see Appendix B, we state 
Theorem 5. Suppose the matrix on the right in (20') is positive definite: 

B + B T Ajj > kl, k>0. (21) 

Then for every periodic, square integrable f the equation 

—— -Ly • 三 -f (22)- 

has a solution y in the sense of distributions that is periodic and square integrable. 


from (20’) 






where j|w|| denotes the L 2 (F)-noirn7Denote the Hilbert space L.-\r) oy /i. * 

• 一 be-any-finke^dimensionaLsubspace of // consisting of periodic C 1 functions; denote 
the orthogonal complement of 7 in // by y 丄 . Consider the equation 


Ly - / e J 丄 ⑽) 

for, y. In Y ： equations for y m Y, N = dimy. According to 

linear aigebra, such a system of linear equations has a solution for every / iff the 
homogeneous equation 

Lz e y x , z in Y, (23) 


ls~satisfied only by z = 0. Take the scalar product of (23) with using (210, we get 

0 =(仏)匕备脚 2 ， 

which implies that z = 0. So it follows that (22 a 0 has a unique solution y. Take the 
scalar product of (22") with y; using (21’）and the Schwarz inequality, we get 


\ lb-ll 2 <(v^v) = 0-,/)< 11)11 ll/ll- 

This implies that 

llyll < \ ll/ll- (24) 


Now let F/v be an increasing sequence of subspaces of C 1 functions whose union 
is dense in H. Denote by )，〜the solution of (22//). It follows from (24) that ||vyv|| 
is a uniformly bounded sequence. Therefore，since H is reflexive, we appeal to the¬ 
orem 7 of chapter 10 to conclude that a subsequence of {3^}, also denoted as {) ， yv }， 
converges weakly: 


w — lim yj^ = y. 


Let v belong to U since each IV consists of differentiable functions, v is 
differentiable, for it belongs to some Ym ，For y^/ in Yn, 

LyN 一 / e 味 


Take the scalar product of 
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Since the sequence converges weakly to y, we conclude that for every i; in U Y t \ / 

(L%，)0 —(i; ， /)=0. (25 ； 

We can choose the spaces 7" so that their union is dense not only in H but also ir 
H\, the space of all periodic L 2 functions whose first derivatives belong to L 2 . Tha' 
means that given any periodic C 丨 function v, there is a sequence [vk] of functionj 
in U Yfsj such that Vk converges in the L 2 -norm to v, and the first derivatives of Vk 
converge to the first derivatives of v in the L 2 -norm. Since L* is a first-order operator 
it follows that L* Vk tends to L* v in the L 2 -norm. Setting v == in (25)，we car 
pass to the limit and conclude that (25) holds for all i; in C 1 . 

A function y that satisfies (25) for all C 1 functions v is said to satisfy the differ¬ 
ential equation (22) in the weak sense. Clearly, such a u is a solution of (22) in the 
sense of distributions, which requires (25) to hold for all Cq° functions v. 匸 

Friedrichs has shown that a weak solution y of (22) is a strong solution in th( 
following sense: there is a sequence of C 1 functions Zn that converge to y in the U 
sense, and at the same time Lz n converges to / in the L 2 sense. It is easy to show 
using (21 7 ), that equation (22) has only one strong solution. It follows that not only « 
subsequence, but the whole sequence converges. 

The method described in this section to obtain the solution y of equation (22) a: 
the weak limit of the solutions jyv of equation (22^) is called GaleHdn’s method. I 
is more than a theoretical device for proving the existence of a solution of (22); it L 
also a practical method for constructing it. 


THEB 

WITH 


POSITIVE REAL PART 


Let./(f) be an analytic function in the unit disk If | < 1 whose real part is positive: 
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We associate with each R n a linear functional 

e„(u) = j\(^R n e w ymde (27) 

acting on the space C of continuous functions u on the circle *S 1 VITF611ows'ff^ 

■:以：• 二" ■ 二 ■■ 二口 -:' 二， 

~~ 141 = _• 

Since C(5 J ) is separable, we can appeal to Helly's theorem, theorem 12 of chap- 
„te£lD^_andx.onclude_thaL:a_s.ubsequence .of [in) is weak* convergent to some limit £: 

lim i n (u) = l(u) (28) 

n-^-oo 、 

for all continuous functions u. It follows from (28) and the uniform boundedness of 
141 that for any sequence u n strongly convergent to u 

lim i n {u n ) = ^(«). (28 ; ) 

/?-^oo 

We apply this now to 

- R n + ? e~ w 1 + f e~ w 

Un = Rn-Ke~ i6 ' U= 1-f 0 ’ 
f any complex number with |f | < 1; using (26), (27) and (280, we get 



weak* limit L According to corollary 14’ of the Riesz representation theorem, chap¬ 
ters, such a nonnegative functional acting on C(5 ! ) can be represented as an integral 
with respectto a positive measure m. Thus we have proved the first part of ' 

Theorem 6 (Herglotz-Riesz). Every analytic function f in the unit disk |f| < 1 
whose real part is positive there can be expressed as 


f 1 + t e~ id 

r^ dm+ic . 


m a positive measure, c real (29) 


Conversely every function f so represented is analytic in the unit disk and has posi¬ 
tive real part there. The representation (29) is unique. 
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representation is unique, we note that the real part of (29) is 
l-r 2 




•2r cos ( 中 —0) + r 2 


re l t 


(30) 


Take any continuous function multiply (30) by w(0), and integrate with respect 
to 0 over 5 1 . We get, after interchanging the order of integration on the right, 


where 


j h (r u ( 中 ) d(/> = j u r (0)dm 

Ur(9)= h 


( 31 ) 


疒 2 


2r cos(0 - 0) + r 2 


w(0) d(p. 


Suppose that h(X) can be represented in the form (30) by two different measures m 
and m f . Let r 1 in (31); by theorem 1 of this chapter, u r u in the maximum 
norm. Since the left side of (31) does not depend on the representing measure, it 
follows that 


J u(9)dm = J u{0)dm 


for all continuous functions u. We appeal now to the uniqueness of measure in the 
Riesz representation theorem to conclude that m = m!. This completes the proof of 
theorem 6. □ 


From the uniqueness of the measure m it follows that the limit (28) exists not only 
for a subsequence but every sequence of R. 
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THE WEAK AND 
—WEAK*—TO—POmGTES" 


Definition. The weak topology in a Banach space is the weakest topology in which 
all bounded linear functionals are continuous. Since bounded linear functionals are 
continuous in the norm (strong) topology it follows that the weak topology is coarser 
than the strong topology. We show now that — except in finite-dimensional spaces 一 
the weak topology is genuinely coarser than the strong topology: 

The open sets in-the-weak_topol ogy are unions of finite intersections of sets of the 
form 


[x:a< l(x) < b}. (1) 

Clearly, in an infinite-dimensional space the intersection of a finite number of sets of 
form (1) is unbounded.. This shows that eveiy set that is open in the weak topology is 
unbounded. In particular, the balls 

, U ： kl <^}, (2) 

open in the strong topology, are not open in the weak topology. 

Next, we show that the weak topology is coarser than weak sequential conver¬ 
gence, in the following sense: define the weak sequential closure of a set S in a 
Banach space X as the weak limit of all weakly convergent sequences in S. 

Theorem 1. 

(i) The weak sequential closure of any set S belongs to the closure of S in the 
weak topology. 

(ii) In eveiy infinite-dimensional Banach space there are sets weakly sequentially 
closed, but not closed in the weak topology. 

Proof. Part (i) is an immediate consequence of the definitions of weak conver¬ 
gence and weak topology. Part (ii) is exemplified by the following set S: 
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5 = 52 U 53 U .... (3) 

Each set 5^ is finite, constructed as follows: Choose any sequence of subspaces 
Xk of X, dim Xk = k. S/c consists of points Xkj, finite in number, so chosen that for 
every point x in of norm k, x in Xk, |.r| = k, there is a point in Sfc such that 

\x-x k j\ < p \x k j\=k. (4) 

We claim that the origin belongs to the closure of 5 in the weak topology. Any open 
set containing the origin contains a subset of the form 

[x : |£/(.t)| < €, / = (5) 

where the i[ are linear functionals of norm 1. Since is ^-dimensional, for k > n 
it contains a nonzero vector Xk such that 

众 ） = 0，. / =： 1， .，•，"， 

⑹ 

By construction, there is an Xfcj in Sk satisfying condition (4) for x = Using (4)， 

(6)，and |d = 1， we get for any i = ti, / = 1, that 

1 

= i(x k ,j -X k ) < \x kj -x k \ < (7) 

Therefore for k > l/€, the point x^j belongs to the subset (5). This proves that the 
origin belongs to the closure of S in the weak topology. 

On the other hand, S contains only a finite number of points in any ball of radius 
R. So, by the principle of uniform boundedness (see theorem 4 of chapter 10), S 
contains no weakly convergent sequences other than the trivial ones. □ 

Despite the coarseness of the weak topology compared to the strong topology, the 
following is true: 


Theorem 2. Every convex subset K of a Banach space X that is closed in the strong 
topology is closed in the weak topology. 

Proof. We will show that if z in X does not belong to K, then z is not in the 

weak closure of K. Since is closed in the strong topology, there is an open ball 


Br{z) centered 
theorem, theorei 
that 



























Smulyan; see Dunford and Schwartz. 

















LOCALLY CONVEX 

-repefcOGiES and the 

KREIN-MILMAN THEOREM 


The weak and weak* topologies are the weakest in which certain linear functionals 
are continuous. If one demands the continuity of even fewer functionals, one gets 
even weaker topologies. All these topologies have the property that openness can be 
defined in terms of convex sets. In this chapter we develop, and apply, the theory of 
such topologies. 

Definition. A locally convex topological (LCT) linear space is a linear space over 
the reals with a Hausdorff topology that has the following properties: 

(i) Addition is continuous; that is, (x, }0 x + : y is a continuous mapping of 
X x X into X. 

(ii) Multiplication by scalars is continuous; that is, (k, x) -> kx is a continuous 
mapping of R x X into X. 

(iii) There is a basis for the open sets at the origin consisting of convex sets; that is, 
every open set containing the origin contains a convex open set containing the 
origin. 

Note that the norm topology of a Banach space is a locally convex topology; the 
convex, open sets containing the origin that form a basis for the topology are the 
open balls centered at the origin. 

Exercise 1. Show that the weak and weak* topologies are locally convex. 

Exercise 2. Let {t a } be a collection of linear functions in a linear space X over K 
that separates points; that is, for any two distinct points x and y of X there is an 
such that t a (x) ^ £ a (y). 
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(a) Show that the weakest topology in which all the i a are continuous is locally 
convex. 

(b) Show that a linear functional i is continuous in the topology above iff it is a 
finite linear combination of the . 

Exercise 3. Show that a continuous function f(a y b) on a product of two topological 
spaces whose values lie in a topological space is a continuous function of a when b 
is fixed. 

Theorem 1. 

(1) In a LCT linear space X the collection of open sets is translation invariant; 
that is t if T is an open set，soT — x, for any x in X. 

(ii) IfT is an open set, so is kT t fork ^ 0; in particular, —T is open. 

(iii) Every point of an open set T is interior to T, 

Proof. According to exercise 3, x + y is a continuous function of y forx fixed. 
The inverse image of the open set T under this mapping is T —x; this proves part (i). 
Part (ii) follows similarly. 

(iii) By exercise 3, kx is a continuous function of k for x fixed. So the set of k for 
which kx lies in some open set T is an open subset of M. Suppose that T contains the 
origin; then k = 0 belongs to this set, and by the above observation, so does an open 
interval containing k = ： 0. That means that for: k small enough, kx belongs to T, but 
that is what it means for the origin to be an interior point of T. This proves (iii), for 
by (i) any point of T can be shifted to the origin. □ 

13.1 SEPARATION OF POINTS BY LINEAR FUNCTIONALS 

Theorem 2. The continuous linear functionals in a LCT linear space X separate 
points. That is，ify and z are distinct points ofX，there is a continuous linear func¬ 
tional i such that ' 

化 V ) 户似. (1) 

Proof. We construct a linear functional separating y and z. Without loss of gen- 
erality we take y = 0. Since the topology is Hausdortt, there is arfopefTseTT^orh' 
■t 组•加 t ndfzrbytiii)’ 石 fthe definition of LCT space, we may take T to be 
convex. By theorem 1, 0 is an interior point of 7, so the gauge function pj of T is 
finite，and 

Pt(u) < 1 for all u in T. (2) 


According to the hyperplane separation theorem, theorem 5 of chapter 3, there exists 
a linear functional t satisfying 
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..^) = 1,.(3) 

Ux) < Pt(x) for all jc. (4) 

Clearly, since i(y) = £(0) = 0, £ separates y and z. 

To complete the proof, we have to show that i is continuous. We first show that 
every halfspace w : I(w) < c is open. We claim that if w belongs to the halfspace, 

w + rT, r = c 一 i(w). (5) 

Using ⑷， （5)，and (2)，we get that for w in 7\ 

l(w + ru) = £(w) + rl{u) < i{w) + rpr(u) < £(w) + c _ l{w) = c, (6) 

showing that every point of (5) lies in the halfspace; so the halfspace is open. We can 
.show similarly that every halfspace of the form 

w : l(w) > d 

is open. For this argument pj needs to be an even function; this will be the case 
if T is symmetric around this origin. This can be accomplished by replacing T, if 
necessary, by T fl (—T). 口 

Theorem 2 can be sharpened as follows: 

Theorem 2’. Denote by K a closed, convex set in a LCT linear space X, z a point 
in X not in K • Then there is a continuous linear functional l such that 

€(}») <c for all yin K, £(z) > c. (7) 

Proof. The proof is similar to that of theorem 2, except that in place of the hyper¬ 
plane separation theorem we use the extended version, theorem 6 iifchapter 3. □ 

Exercise 4. Let K denote a convex subset of a LCT linear space X. Show that the 
closure Y of K also is convex.. 


13.2 THE KREIN-MILMAN THEOREM 

We recall from the end of chapter 1 the notion of an extreme subset of a convex set 
K and, in particular, the notion of an extreme point. A subset £ of a convex set K is 
called an extreme subset of K if: 

(i) E is convex and nonempty. 

(ii) Whenever a point of 丑 is expressed as a convex 
K, then both .v and z belong to E. 


combination 
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An extreme set consisting of a single point is called an extreme point. 

Exercise 5. Show that the nonempty intersection of extreme sets is extreme. 

The elementary properties of extreme set are contained in theorems 7 and 8 of 
chapter 1. The basic result concerning convex sets in finite dimensional spaces is the 
following theorem of Caratheodory: 

Every compact convex subset K in has extreme points, and eve)"y point of K 
can be written as a convex combination ofN + 1 extreme points. 

Exercise 6. Furnish a proof of Caratheodory’s theorem by induction on N. 

M. G. Krein and D. P. Milman have given the following beautiful 一 and useful — 
generalization of this result: 

Theorem 3. Let X be a LCT linear space, K a nonempty, compact, convex subset 
ofX. 

(i) K has at least one extreme point 

(ii) K is the closure of the convex hull of its extreme points. 

Proof. Consider the collection {Ej} of all nonempty closed extreme subsets of K. 
This collection is nonempty, for it contains K itself. Partially order this collection by 
inclusion. We claim that every totally ordered subcollection {Ej} has a lower bound. 

That lower bound is the intersection (IEj. To see this, we have to show that HEj is 

nonempty, closed, and extreme. 

We claim that every finite subset of the totally ordered collection {Ej} has a 
nonempty intersection. This is because in being totally ordered by inclusion, the 
intersection of a finite subset of the collection {^y} is the smallest member of that 
subset. To conclude that f\Ej is nonempty, we argue indirectly: suppose that the in- 
丄级從 Q 玫 Qtt j$ .of themmRlements of the cover A：. Since K 

is compact, a finite collection of these already cover K, but then the intersection of 

these finite number of Ej is empty, contrary to what we have already shown. Being 

the intersection of closed sets, DEj is closed. By exercise 5 the nonempty intersec- 



艮 frQmJZQ 明 Uj^mma„that : K_Jias_ a closed extreme subset E that is 
minimal with respect to inclusion. We claim that such an E consists of a single point. 
To see this, suppose, on the contrary, that E contains two distinct points. According 
to theorem 2, there exists a continuous linear functional l that separates these points. 
Since E is compact, and € continuous and not constant on E,t achieves its maximum 

_on some proper subse t—M_o _fJL_SingeJ_is_c_cmtiii—uous and E is closed, M is closed. 

Since the inverse image of an extreme subset is extreme (see corollary 8 / in chap¬ 

ter 1), the set M where a linear functional i assumes its maximum on a convex set 
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E is an extreme subset of E. It is easy to show further (see theorem 7 of chapter 1) 
- that4f 五 is an. extreme subset of K, and M an extreme subset of E, then M is an 

extreme subset of K. Since £ is a minimal extreme subset of AT, and M an extreme 

subset smaller than E, we have a contradiction, into which we got by assuming that 

E contains more than one point. We conclude therefore that a minimal E consists of 
a single point. This single pointTs ah"e3cfreme point of K. This completes the proof 

(iO Every closed, extreme subset of K contains an extreme point. 


We turn now to the proof of part (ii). Denote by K e the set of extreme points of 
K ，and by K e the convex 'hull' of show that every point of K belongs to the 

closure of K e is the same as showing that a point z that does not belong to the closure 
of K e does not belong to K. According to exercise 4, the closure of K e is convex. So, 
if z does not belong to the closure, then according to theorem l! there is a continuous 
linear functional l such that 

^(y) < c for all v in K e , l{z) > c. (8) 

Since K is compact and t continuous, i achieves its maximum over K on some 
closed subset E of K. According to corollary 8-, chapter 1, £ is an extreme subset 
of K. According to part (i ; ) of theorem 3 noted above, E contains some extreme point 
p of K. Since p belongs to K e , mdsdioK e ,it follows from (8) that i(p) < c. Since 
by construction i{p) = max 尺 £(x), l(x) < i(p) < c for all x in K. Since by (8), 
£(z) > c, this proves that z does not belong to K. □ 


13.3 THE STONE-WEffiRST^SS THEOREM 

Theorem 4. Let S be a compact Hausdorff space, C (5) the set of all real-valued 
continuous functions on S, I^^ ..—— 

(i) E is a linear subspace of C (S). 

(ii) The product of two functions in E belongs to E. 

In addition we impose the following conditions on E: 

(iii) E separates points of 5, that is, given any pair of points p and q、p ♦ q、there 
is a function f in E such that f(p)_ /(<?)• 

(iv) All constant functions belong to E. 

Conclusion: E is dense in C (S) in the maximum norm. 

The classical Weierstrass theorem is a special case of this proposition, with S an 
interval of the x axis, and E the set of all polynomials in x. We present Louis de 
Branges’s elegant proof, based on the Krein-Milman theorem, of Stone’s generaliza¬ 
tion of the Weierstrass theorem. 
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to h and dividing it by another large constant，we obtain a function g whose values 
-liertetweeTi O and—lTand g(p) # g(q). This contradicts our previous conclusion. 

A measure [l whose support consists of a single point p, and ||从|| = 1， is a unit 
point mass at p. Therefore 


— 一 — 一 J = f(p) orj- f(p). 

Since, by hypothesis, the constant1belong to E, j Jdfi ^ OTor a 

contradiction. □ 


T3.4 CHOQUET 5 S THEOREM 


The following further ( 
linear spaces: • 


of Caratheodory’s theorem holds on locally convex 


Theorem 5. Let X be a LCT linear space，K a nonempty compact, convex subset 
ofX, K e the set of extreme points of K. For any point u of K there is a probability 
measure m u on K et the closure of K ei tht is a measure satisfying: 


such that in the weak sense 


The weak sense of the int 
linear functional l over X t 


representation above 


i(e) dj7i lt (e). 


Wmmm, 


minimum and maximum are extreme subsets of K. According to part (i’）of theo¬ 
rem 3, these extreme subsets contain extreme points. Therefore for any u in K and 
for every continuous linear functional t, 

min £(p) < i(u) < max £(p). (11) 

p in K e p in K e 

It follows from (11) applied Jo t\ 一 ti that if and £2 are equal on K e , they are 
equal on K\ therefore i on K e uniquely determines the value of £(u) for every u 
in K. Denote the restriction of ^ to K e by /: 


f(q) = 咐)， 


q in K e . 


( 12 ) 
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Since i{u) is determined by /, we can write 

m =«(/), (i20 

clearly a linear functional of /. We rewrite (11) as 

mm f(q) < u(f) < n\3.\ f{q). (13) 

q in K e q in K e 

The set L of functions / defined in (12) forma linear subspace of the space C{K e ) 
of continuous functions on K e . We claim that we can extend the linear functional 
u(f) defined in (12’）from L to all of C(K e ) so that property (13) is preserved. 
To see this, we adjoin the function /o = 1 to L and define u(fo) = 1. Then we 
appeal to theorem 1 of chapter 4, according to which a positive linear functional can 
be extended positively to the space of all functions. Since (13) implies that u(f) is 
positive, such an extension is possible; imagine it done. 

K e is a closed subset of the compact set K and is therefore compact. We appeal 
now to the Riesz-Kakutani representation theorem (see chapter 8, theorem 14) ac¬ 
cording to which a bounded linear functional u on C(K e ) can be represented as 


«(/)= 



f dm. 


(14) 


Since the functional is positive, so is the representing measure m\ it follows from 
«(/o) = 1 thatm(/sT e ) — 1. Setting (12) and (12’）into (14), we obtain (10’). □ 


Theorem 5 asserts that every point u of the compact convex set K can be rep¬ 
resented as a continuous convex combination of points of the closure of the set of 
extreme points. This proviso is needed because the set of extreme points may not be 

closed, not even in a finite-dimensional space. Say we take in E 3 the convex hull of 

the circle: x 2 + y 2 = 1, z = 0, and the interval: 



The extreme points of the convex hull are x = 1, y = 0, z = ±1, and all points of 
the circle x 2 + 少 2 = 1， z = 0 except x = 1, _y = 0, z = 0. 


Exercise 7. Let u be a point in K e that does not belong to K e \ show that v can then 
be represented as 


e dm ， 


where m is a probability measure on K e such that m(v) = 0. 
Exercise 8 、 Deduce part (ii) of theorem 3 from theorem 5. 
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Ghoquet gave the following sharpening of theorem 5: 

Theorem 6 (Choquet). Let K be a nonempty compact, convex subset of a LCT lin¬ 
ear space, and assume in addition that K is metrizable. Then every point u of K can 
be represented in the weak sense as 

(10") 

where m u is a probability measure on the set of extreme points. 









under the mapping (17). The triple product (18) is compact, and according to the 
definition of a LCT space, the mapping (17) is continuous. It follows that the image 
of the compact set (18) is compact, as claimed in lemma 8. □ 


From lemma 8 we deduce inductively that the convex hull of the union of a finite 
number of compact sets is compact. We turn to the compact sets defined above 
and claim that the convex hull of their union, denoted as CH [尺 i U … U K n ], con¬ 
tains K: 

K cCU[K { U..,UK n ]. (19) 

We note that contains 5/. Therefore, by.(16), K\\J. ,.UK n contains S\ U...U 
S n = S. By lemma 8, the right side of (19) is compact, and therefore closed. Thus it 
is a closed, convex set that contains S. But K is defined to be the smallest such set， 
and it therefore is contained in CU[K i U... U K n ]. This proves (19). In words，every 
point of 尺 is a convex combination of points of Kj, Since each Kj is contained in 

K, it follows from the definition of extreme point that each extreme point p of K 

belongs to a K l： 

By definition. Si is contained in yi H- N. Since N is convex, the convex hull of Si 
belongs to yi N: Si C yi + N. For any set R, R + N contains the closure of R. 
Since is the closure of 5 / , it follows that K\ C yi + N + N == yi+lN. Therefore， 
since each 刃 belongs to 5, 

UKi CS + 2N, 


We have shown that each extreme point of K belongs to some K[, so it follows from 
上 e result above that every extreme point p of K belongs to S 4- 2N. Now N is 
arbitrary; since S is closed, the intersection of all sets 5 + 2iV is 5 itself. Thus every 
extreme point p of K is contained in 5. □ 

Theorem 7 is useful in identifying extreme points. 


Exercise 9. Show that if 5 is a compact set in a Banach space, its closed convex hull 
is compact. Is this true in every LCT space? 

NOTE. The prime-examples of-LG-Tlinear-spaees-are BanaGh-spaGes in the weak and 
weak* topologies. Other important examples are spaces of distributions. In view of 
the enourmous success of the theory of distributions in the theory of partial differ¬ 
ential equations and iii harmonic analysis, it was thought that other locally convex 
topologies niight play - a''similarly £rtrrtful roletiiratiit)pe-has-not^et-beeiT-reafeed： — 
^ ---- -Otherapplications of the--Krein-Milman theorem and- its-generalizations are de¬ 
scribed in Diestal and Uhl. 
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EXAMPLES OF CONVEX 
SETS AND THEIR 
EXTREME POINTS 


In this chapter we present a great variety of examples of convex sets, their extreme 
points, and Choquet-type integral representations of points of the set in terms of the 
extreme points. In some examples the extreme points are determined by a direct argu¬ 
ment Then a locally convex topology is introduced so that the convex set in question 
is compact and the Choquet-type representation is then derived via Choquet’s the¬ 
orem. In other examples the Choquet-type representation is derived directly by an 
analytic argument. The representation is then used to identify the extreme points of 
the set. In most of these examples the representation is unique. 


14.1 POSITIVE FUNCTIONALS 

Let Q denote a compact Hausdorff space, and C{Q) the space of continuous func¬ 
tions on Q whose values are real. We denote a linear functional t defined on C(2) 
as positive if 1(f) >0 for ail nonnegative / in C(Q). Recall from chapter 8 that a 
positive linear functional is bounded. Denoi^by. P. the coiLectioni).Lftll positive linear 
functionals t that satisfy — 

^( 1 ) = 1 . ( 1 ) 


Theorem 1. PJsji_conyex setj^vhose extreme points are the point evaluations e n . 
defined as 

erU) = Z(r), (2) 

r any point of Q. 










⑶ 



t of positive linear 
concentrated at a 


资 ft 鐵 If »】 i ” w 刺 < iwcMi ) 讁 TO 呀■■化 ks*» >ym» 


we get 之 =flti 十 （1 一 a)i2 - Ar ^ were an extreme point, i\ = t ’2 = 名， and so t 
can be represented in form (5) by the distinct measures m j and m 2 . This contradicts 
uniqueness of the representing measure. This completes the proof of theorem I. □ 
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We can use formula (5) to rewrite the formula 2 symbolically as 


j e r dm(r )， 


⑹ 


a Choquet-type representation. 


14.2 CONVEX FUNCTIONS 

In this section we make use of the notions and results of the theory of distributions 
as explained in Appendix B. 

Definition. A real-valued function / in R n is convex if it satisfies 

/ (J2 a j x j) ^ T, a jf( x j) ⑺ 

for all choices of ， •.. ，工汉 in and all a j satisfying a j > 0 , = 1 - 

Here we consider convex functions / of a single variable. It suffices to assume 
(7) to hold for iV = 2: 

f(ax + (l-a)z)< af(x) + (1 — a)f(z), 0<a<\, (8) 

for all x ， 4 . Setting ax + (i — a)z = J, condition (7) is easily seen to be equivalent 
with the following: for x < y < z t 

< fU)-f(y) (9) 

y — x 一 z — y 

It follows from (9) that every convex function is continuous and has right and left 
derivatives. 

The second difference quotients 、 

~ +~/2) -2f(x) + f{x-h) ⑽ 

converge in the sense of distributions to f n as h tends to ze ro. It follows from (9) 
that the difference quotients (10) are nonnegative; since the limit in the sense of dis- 
tnfautlons"oFa rionnegative distribution is nonnegative, it follows that for convex /， 

0<r (11) 

in the sense of distributions. 


convex 












and define the function <p x (y) by 


The function <p x is piecewise linear and 中 ’】 = 8(r — x).lf we could substitute 0 = (f) x 
in (15)，we would obtain 


fix) = / ct> x {r)f\r)dr. 


Since (j) x is not C°°, this is not legitimate, so we approximate ^> x by a sequence of 
C°° functions. Since f(r) is continuous for，• < 1， the left side of (15) tends to the 
left side of (17). On the other hand, the nonnegative .distribution /" is a nonnegative 






measure; therefore the right side of (15) tends to the right side of (17). This proves 
(17) fovx < 1. We can rewrite it using notation (13) as follows: 


f(x) = J e r {x){l - r) f"{r)dr, x < 1. (18) 

We let .r -» 1 and obtain 

= lim f (x) = / (1 -r) f"{r) dr. (18 ’） 

J 

Since / is an increasing function, and /(l) = l,mi < 1. We can combine formulas 
(18) and (18’）into one and write 

/U)= C e r {x)dm{r\ (19) 

Jo 

where 

m(r) = /* (1 — s) ds for/* < 1， and m(l) = 1 一 m!. (19 ; ) 

JO 

The measure m is uniquely determined by the convex function /; this follows readily 
from formula (19). It follows, as in section 14.1, that the only extreme points of the 
set C of convex functions satisfying conditions (12) are those where the measure m 
is concentrated at a single point of the interval [0,1]. This completes the proof of 
theorem 2. □ 

We can rewrite (19) symbolically as 

. f=f'e r dm ， (20) 

Jo 

a Choquet-type representation of convex functions. 

Exercise 硫 6 百前 heore 衍 Tf 沉说顾 x funcTiofis'of /rvariables. 

Exercise 2. Prove theorem 2 without the theory of distributions, using the theorem 
of Krein-Milman. 


14.3 COMPLETELY MONOTONE FUNCTIONS 

The difference operator D a ，acting on functions of a single variable, is defined by 

- • - 中 cr)/(f) 了 . . (21) 

For a > 0, D a maps functions f defined on IR+ into functions defined on R+. 
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Definition. K real-valued function / defined on K+ h completely monotone 
(c.m.) if 

㈠ )”的〜卜 0 

for all aj > 0 and all n = 0,1， . • •. 

The following result is due to S. Bernstein: 

Theorem 3. Every completely monotone function f on M+ can be represented as 
f(t)= f°° e~ x 'dm(X), (23) 

Jo 

m some nonnegative measure, m(R+) < oo. Conversely, evety function of form (23) 
is completely monotone. 


on M+ (22) 


Proof. To show that / of form (23) is completely monotone we write 
D a f = f D a e- >J dm(k) = j(e~ aX - \)e~ Xt dm{X)\ 
then — 

(-1.)" (fj D a; ) / = / [](1 - e- 叫 ) e_ 


dm 


is clearly nonnegative. 

Turning to the direct part we will make use of the following properties of c.m. 
functions. 


Lemma 4. 


(i) The sum of two c.m. functions is c.m. 
Suppose that f is a c.m. function; then 

(ii) f is nonnegative, 

(iii) af is CMhfora > 0. 

(iv) —D a f is cm. for a > 0. 

(v) T a f = f(t +a) is c.m. for a > 0. 

(vi) Hbf = f{bt) is c.m. for b > 0. 

(vii) f is nonincreasing. 

(viii) f is convex. 
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Proof. Parts (i) and (iii) follow from the fact that the operators D a appearing in 
condition (22) characterizing c.m. functions are linear. Part (ii) is (22)，parts (iv) and 
(v) can be deduced by applying the operators respectively to (22)，and noting 
that these operators commute with D a . Part (vi) follows by applying Hf, to (22)，and 
noting that 


= D a jy-\ Hfj. 
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.....Clearly,. . …… ..... - . 


e = (1 - e(a))f + e(a)g. " (27 ，） 

By definition of extreme point given in the previous chapter, it follows from (270 
that f = g = e.ln particular, by (27 乂 this implies that for all t and a, 

— e(0e(fl) = e(t + a)^ . . . 

All continuous solutions of this equation are exponential functions. It follows from 
part (viii) of lemma 4 that every / in K is convex, and so is continuous for r > 0. 
We.caa conclude that_ _ _ _____ ’• _ 

e(t) = e~ x, . 

That 入 is > 0 follows from part (vii) of lemma 4. 

The cases where (26") fails to hold for some g > 0 can be easily handled. When 
e(a) = 1 for some a > 0, e = eo; when e{a) = 0 for some a > 0, e = eoo. □ 

We introduce now the topology for functions that is the coarsest in which all the 
linear functionals i t \ 

W) = f(t), 0<t, (29) 

are continuous. The topology is the product topology 

n/(o. 

0<t 

According to (26), the values of f in K lie between 0 and 1. So iT is a subset of 

n[ o , u 

0 </ 

which by Tychonov’s theorem is compact. So to show K compact, it suffices to show 
that K is closed. But this is easy: for fixed a } and r, the set of / that satisfy (22) is 
clearly closed. K, being the intersection of these sets for all aj and all r > 0, is 
closed. 

We showed in lemma 5 that the extreme points of K are contained in the set { 占入}， 

0 < A. < oo defined by (25a), (25b). The set {e\}, it is easy enough to show, is closed 
and therefore contains the closure of the set of extreme points. 

We appeal now to formula (10) in theorem 5 of chapter 13. That formula with e 
given by (25) and i by (29), is precisely the desired representation formula (23). 

□ 

Some corollaries and addenda: 
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Theorem 6. 


(i) Every completely monotone function is C°°. 

(ii) The representation (23) is unique. 

(iii) Every ex defined by (25a), 0 < X < (X) is an extreme point. 


Proof. Since the measure m is > 0 and m(R+) < 00, we can differentiate (23) 
with respect to t under the integral sign; this proves (i). 

For (ii) suppose that some f in K had two distinct representations. Subtracting 
them, we get 


厂 

Jo 


e~ Xt dv(X)^0 } 


v some signed measure of finite total mass over R. The function 


_)= f e-^dvM 

Jo 

is then an analytic function continuous in the right half-plane Ref > 0 that vanishes 
on the real axis f = >. It follows that s 0, in particular, that 


F(ix) = 0 for all real r. 

F{ix) is the Fourier transformation of the measure dv. By uniqueness of the Fourier 
transform, we conclude that dv = 0, meaning no / can have two different represen¬ 
tations of form (23). 

For (iii), since K is compact, by Krein-Milman, it has at least one extreme point 
By lemma 5, the extreme points must be of the form ex given by (25). They musL 
include more than eo and eoo since the convex combinations of e\ and ^oo do not 
include all of AT. So some 入 # 0,00 is an extreme point of K. According to 
lemma 4 (vi), the operator maps K into K\ being a one-to-one linear map, /•/" 
carries every extreme point of K into an extreme point of K. So, if e\ is extreme, so 
is 


Hbex = exb. . b > 0 . 

This completes part (iii) of theorem 6. 


□ 


- -We remark4hat-an analogue af-Berastein!s-theor^ni-holds4n^-4i-m&ns-i©flal-^ae^_ 

namely for functions defined on E!|_. It also holds for.functions.on ,,, 一 : 

14.4 THEOREMS OF CARATHEODORY AND BOCHNER 


Definition, A skew-symmetric doubly infinite sequence [a n } of complex numbers: - 
a^ n = a ni GO) 


4 
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C 2 L\\&d positive definite if - ^ .. 

Y^a n -kMk o ..( 31 ) 

n,k 

)r all finite sets of complex numbers -N <n<N. 

The following result is due to Toeplitz ， Caratheodory, and Hergiotz:_ 

heorem 7. All positive definite sequences can be represented uniquely as 
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For any C 00 function 少， 

J fa d0 = h a n, (34 ; ) 

where ^ are the Fourier coefficients of xf/. It follows from (33) that the right side 
converges. We claim that a is nonnegative; to see this，take any trigonometric poly¬ 
nomial of degree N t 

N 

QN(0) = e in0 . 

一 N 

Then |^(^)l 2 = J2n y k Mk e i{n ^ k)d . Set t/t = |^v| 2 in (34’). We get an expression 
on the right that, by (31), is nonnegative: 

J a\q N (9)\ 2 dQ = Y^ a n-kMk > 0* (35) 

Let q(9) be any C°° function on S 1 ; it is easy and classical to show that q can be 
approximated by a sequence {^yv} of trigonometric polynomials in the C 00 topology. 
By definition of distribution, as N tends to oo, and (35) tends to 

j\qm 2 ade>Q, . (36) 

for all C°° functions q. Let p(6) be any C°° function that is positive on S { . Then 

m = /m 

is a C°° function; therefore (36) implies that 

J p(e)ade >0 ( 37 ) 

for any positive C°° function p. A distribution a with this property is called nonneg¬ 
ative. It is a classical result of the theory of distributions, see Appendix B, that every 
nonnegative distribution is a_nonegative measure. Thus cid6 = dm and^formula (34) 
is the desired formula (32). □ 

Theorem 7 can be extended to functions a defined on Z k , k any positive integer; 
the proof is the same. 

NOTE. Caratheodory’s own proof made use of his theorem on convex sets in finite - 
dimensional spaces. In section 14.6 we will give yet another proof, using the theory 
of positive harmonic functions. 
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Exercise 3. Denote by P the set of all positive-definite sequences normalized by 

肩 =1. 厂一 一 ~ (38f 


(a) Show that P is a convex subset of the space i°° of ail bounded sequences. 
Show that P is a compact subset of £°° in the product topology. 

(b) Show that the extreme points of P are of the form _ 


'WY 


and deduce the representation (32) using theorem 4 of chapter 13. 

An important extension of Caratheodory's theorem is due to Bochner: 

Definition. A skew-symmetric complex-valued continuous function a (5) on E: 

a{-s) =a(s), (40) 

is called positive-definite if 

Y^ a ( s J - s k)<pj^k > 0 ( 41 ) 

for all choices ofs\,...,s^ on E, and for all complex numbers 
Exercise 4. Show that condition (41) is equivalent to the requirement that 

j j a(s — ds df > Q (410 

for all continuous, complex-valued functions (/) with compact support. 


Theorem 8. Eveiy continuous positive-definite function a can be represented 
uniquely as 


a(s) = J e ias dm{o), (42) 

m a nonnegative measure on M, m(M) < 00. Conversely，every function of form (12) 
is positive-definite. 

Proof. We show first that every function of form (42) is positive definite. Setting 
(42) into the left side of (410 yields 


J J J ^ lG ^^<P(s)4>(t)ds dt dm(a) = J \4>(cx)\ 2 dm(a), 


where 在 is the Fourier transform of 沴 . Clearly, the right side is nonnegative. 





THEOREMS OF CARATHEODORY AND BOCHNER 1 

To construct the measure m for a given positive-definite function, we proceed 
in the discrete case. We deduce from (41)，analogously to (33)，that 

\a(s)\<a(0). (4 

We recall from section B.5 of Appendix B the Schwartz class of functions 5, co 





























b{a)\ir{a)\ 2 da > 0. 




，0 ° function with compact support on M. Then f = 
support, and so belongs to S. Setting 伞 2 = p into 


I b(a) p{a)da > 0 







for every nonnegative C°° function p with compact support. Such a distribution b 
is called nonnegative. According to theorem 13 of Appendix B, fe is a nonnegative 

measure: b(a) da = dm. .... -…——-… _ . .. …. 

We claim that the total mass of m is finite: • — ~ . . . . 



b{a)d.a < oo: 



*Fonergl5e~any n'orinegative C ，c0 tunction. of compact supporTlhat is equal 1 on" 
[ 一 3,1]. Define (a) = g(cr/n). Denote by /.the Fourier inverse of g. Then the 
Fourier inverse of g n is f n (s) = nf(ns). Set /" into (44): 


j b(a)g d) da = J^ a(s)n f(ns) ds. (48) 

The measure b and the function g are nonnegative, and g(a) = 1 for \a\ < 1. 
Therefore the left side of (48) is greater than 


b{a)da. 


(480 


On the other hand, according to (43). |a(^)| < ^(0). Therefore the right side of (48) 
is less than 


a(0) / n\f(ns)\ds^a(0) / \f(s)\ds, 


a quantity independent of n. This shows that the integrals (48’）are bounded indepen¬ 
dently of /z, proving (47). 

It follows from (47) that a (^) can be represented pointwnse as the Fourier transform 
ofb: 


a(s )= 



for every s, as claimed in (42). The uniqueness of a representation of form (42) 
follows from the uniqueness of the Fourier transform. □ 

Denote by P the set of positive definite functions a normalized by a(Q) = 1. It 
follows from theorem 8 that the extreme points of P are the exponentials e tGS ,a real. 
Thus (42) is seen as a Choquet-type representation of positive definite functions. 

Theorem 8 is easily extended to n dimensions; see Rudin’s book, Fourier Analysis 
on Groups. 

Laurent Schwartz has given the following extension of Bochner’s theorem. 


Definition. A skew-symmetric complex-valued tempered distribution a(s) on R is 
called positive definite if 







ci(s — t)4>{s)4>{t)dsdt > 0 


for all Cq° functions 0. 

Theorem 8’. Eveiy positive definite tempered distribution is the Fourier transform 
of a nonnegative measure of class S’ • 

Schwartz has extended his theorem to E n . 


14.5 A THEOREM OF KREIN 


Definition, Let p be a continuous real-valued even function defined on R: 


〆-，）== p(t). 

p is called evenly positive definite if 


J J p(s- t)^{s)^>{t) ds dt > 0 


for all real-valued, continuous, even functions of compact support: 

H—s) =<Ks). 


(49) 


Clearly, every even function of form (42) has this property. These functions can be 
written as 


疒 CO 

p(s) = / coscrsdm(a), dm > 0. 

«/o 

These are, however, not all. For all real 入 ， and all even, real-valued continuous func¬ 
tions (j> with compact support , 


J J coshX(^ - t)(j)(s)(l)(t)dsdt 


J e Xs </>(s)ds J e Xt <p{t)dt. 


This shows that cosh Xj. is evenly positive definite. But then so is 
p{s) = J cosh Xs dn(X), 

n any nonnegative measure for which the integral converges for all s. 


Similarly an even, real-valued function on E is called oddly positive definite if (49) 
holds for all real-valued, continuous odd functions 0. Examples of such a function 
are — cosh 入方 and superpositions. 


t8 EXAMPLES OF CONVEX SETS AND THEIR EXTREME POINTS 

M. G. Krein has proved the following result: 


Theorem 9. Every real,, even, continuous function p on W that is evenly positive- 
definite can be represented uniquely as 

.. /* OO .... /* OO . — . . — — - — 

p(s) = j cos as dm(a) -j- J cosh 入 s d/2 ( 入 ), 


m and n nonnegative measures. Similarly, every oddly positive-definite function can 
be represented as 


cos as dm(a) — I cosh ksdn(X). 


An easy consequence is 

Theorem 9' Denote by P the set of evenly positive-definite functions, normalized 
by p(0) = 1. P is a convex set, and its extreme points are 

cos as, a > 0, and cosh ks , 入 > 0. 

For a proof we refer to Krein. 

14.6 POSITIVE HARMONIC FUNCTIONS 


In chapter 11, section 11.6, it was shown that every harmonic function h defined in 
the unit disk and positive there can be represented uniquely by Poisson’s formula 


h(re ix )= 


2r cos(x 0) -+ r- 


dm(9), 


(50) 


m a nonnegative measure. 

It is easy to verify that the Poisson kernel has for r < 1 the following Fourier 
expansion: 


i r 2 oo 


Setting this into (50) gives the Fourier expansion of h: 

h(re i ^) = ^2bcr^e i£ ^, 


(51) 


(52) 


j e~ itB dm{6). 


where 


( 52 ，) 



low now how to deduce Caratheodory’s theorem，theorem 7, from I 



y, where x + iy = re lx in the unit disk. We claim that k is positive. To see this, we 
rewrite the left side of (31) by introducing n — /: = € as a new variable. We get 

>0. (530 

i n 

We want to choose [<p n ] so that for all i and r < l, x given 

= rW e itx . (54) 

n 

To satisfy (54) we multiply it by e~ lt9 and sum over €. We get 

Emr ue =jy eieie(x -' (540 

i n I 

The left side can be written as 
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__ h 1^1 < 1- .— ' _i58}_ 

Show that the convex set P defined above is compact in this topology. (Hint: Use 
Harnack’s theorem on positive harmonic functions.) 


14.7 THE HAMBURGER MOMENT PROBLEM 

A sequence of real numbers ao ， a\,... is called positive in the sense of Hankel 
(H positive) if 

5: 0 (59) 

tijc 

for all finite collection of real numbers n = 0,1,..., iV. 

Let m be a nonnegative measure on M all of whose moments are finite. 

f t 2n dm(t) < oo, « = 0,1，— (60) 

JR 

Define 

= :/ £ = 0 ， 1，•… (61) 

JR 

We claim that this sequence is H positive, for 

J2 a n+k^k = f f d ， Mt)> 0. (62) 

n、k J n，k J 

Conversely, Hamburger has proved: 

Theorem 10. Every H positive sequence {a n } can be represented in the form (61). 

For proof, we refer to chapter 33. An interesting fact is that there are H positive 
sequences that can be represented in form (61) in only one way, as well as others that 
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have several distinct representations. Why this is so will be explained in chapter 33 
on self-adjoint operators. 

Denote by Hq the set of all H positive sequences normalized by flo = 1. It follows 
from theorem 10 that every extreme point e of Hq is of the form 

e^t) = t k , k in Z+; t real. (63) 

It is not hard to show that conversely, every sequence e(t) of form (63) is an extreme 
point of Ho, Thus (61) is a Choquet-type representation of the set Hq. 

Examples of Hankel positive sequences: 

Take in (61) as 

dm(t) f f^- 1 for0<f <1,S>0 

dt — 1 0 otherwise. 

Then 

ciz = 

Thus 

㈣ 

for all real 

In conclusion, we note that theorem 10 fails to hold in more than one variable. 


t^dt. 


£ + 5 


14.8 G. BIRKHOFF’S CONJECTURE 

Definition. An n x n matrix S = (sij) is called doubly stochastic if 

(i) all entries are nonnegative, , 

Sij > 0 . ( 66 ) 

(ii) all row sums and column sums are equal to 1, 

: 一一 — —— = 1 ， J2 Si J = 1 forall/,resp.;. (660 

j 

It is obvious that the set D of all doubly stochastic matrices forms a convex set 
in IT 2 . 

A permutation p of n objects is a one-to-one map of the indices 1,..., /z onto 
themselves. The associated permutation matrix P is defined by 




% 
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一 p .._ f 1 ^J-=p(n - 

—. P,J ~ ro if / ^ Vcn. _ : 


Clearly, each tow, and each column of a permutation matrix P,-contains exactly one 
entry equal to 1 and all others are zero. This shows that each P is doubly stochastic, 
that is, P € D. We claim that each 尸 is an extreme point of the set D. To see this, 
suppose that P were the midpoint of an interval, whose endpoints 

both belong to D. Clearly, it follows from (66) that if pjj = 0, then qij = 0, and 
from (66Q that if pg == 1， then qg = 0. Since all entries of P are either 0 or 1， it 

follows that Q = 0. This'proves that P is extieme: --- - - … 

Conversely, D. Konig and G. Birkhoff have shown that all extreme points of D 
are permutation matrices P. 

Exercise 6. Prove the Konig-Birkhoflf theorem. 

By Caratheodory^ theorem, it follows that all doubly stochastic matrices are 
convex combinations of permutation matrices. This representation, however, is not 
unique in general. 

Definition. Ann x n matrix S = (sfj) is called doubly substochastic if 

(i) all entries are nonnegative, 

Sij > o. (67) 

(ii) all row sums and column sums are < 1, 

^>5 1， ^>£1 (670 

for all z, respectively j. 

We denote the set of all doubly substochastic matrices by Dq. Clearly, Do is a 
convex set that contains the set D. We call a matrix Po a subpermutation matrix if 
its entries are either 0 or 1, and if each row and column contains at most a single 
entry 1. Every Po belongs to Do- The argument used above to show that every P is 
an extreme point of D can be used to prove that every Pq is an extreme point of Do. 
Conversely, 

Exercise 7. Show that all extreme points of Dq are subpermutation matrices Pq. 


We turn now to infinite matrices S = (^y), /. j in Z+. The notions of doubly 
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fined exactly as in the n x n case. We denote by X the linear space of all matrices S 

with real entries whose rows and columns have uniformly bounded norms: 

sup ^1^-1 < oo, sup^]|j/y| < 00. (68) 

1 J j i 

We deal first with doubly substochastic matrices. As topology we use the coars¬ 
est topology in which all the linear functionals that map the matrix S into its O'th 
component are continuous: 

t u (S)^s u . (69) 

We recall the following result from chapter 13, exercise 3: the only linear func¬ 
tionals in X that are continuous are finite linear combinations of the 


Theorem 11. 


(i) The extreme points of the convex set Do consisting of all doubly substochastic 
matrices is the set {Pol of subpermutation matrices. 

(ii) Dq is the closure of the convex hull of [Pq] in the topology induced by the 
functionals (69). 

Proof. We first prove part (ii). Suppose, on the contrary, that some doubly sub¬ 
stochastic matrix Z does not belong to the closure of the convex hull of {Pol* Ac¬ 
cording to theorem l! of chapter 13, there would exist a continuous linear functional 
i such that 


HZ) > c t (70) 

but for all T in the closure of the convex hull of {Po)» 

i(J.) < c. (71) 

We may, in particular, 作了 

t(Po) < C. (710 

The only linear functionals i that are continuous in the coarsest topology that makes 

s tHFfuH56dWre}7mT69)T6n'tihuourarFffirfin 

卜 [〜•£". . . (72) 

ij<n 


Denote-by the projection-of-any~substoehastie-matTix~-5-onto-the-n x-n matrix 
formed by the intersection of the first n rows and columns of S. Clearly, S n is a 
doubly substochastic (n x n) matrix. Then it follows from (69) and (72) that for 
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any 5, 


£(S) = US t1 ). .(73) 

Denote the projection of Z by Z n . As remarked earlier, the extreme points of the 
set of doubly substochastic n x n matrices are the /? x ?? subpermutation matrices. 

It follows from Carath6odory’s-theorem that on a compact convex set, a-continuous- 
linear functional takes.its maxinmnuMie of_the_e;ctreme..points^_We.iiay.eJhat_ 

^CZ^^sup^), (74) 

Pn 

where the P n are n x n subpermutation matrices. Such a P n is the projection of a ' 
subpermutation matrix Pq of infinite order whose elements not in the first n rows and 
columns are set = 0. By (73), 

l(P n ) = i(P 0 ) and €(Z n ) = €(Z). (75) 

Combining (74) with (75), we obtain 

^(Z)< snpi(P 0 ). 

Pq 

This combined with (710 shows that (70) cannot hold. Therefore every Z in Dq 
belongs to the closure of the convex hull of {Pol- 
To show that, conversely, all points of the closure of the convex hull of {Pol belong 
to Dq, we rewrite the criterion (67) and (670 for belonging to Z>o as follows: 

~ ⑹ >0 ， （ 76) 

^ 1 - < 1, (760 

}<n i<n 

for all positive integers n. Since by definition of the topology, the functionals (76 )， 
(76 / ) are continuous, and since these inequalities hold on the convex hull of {Po}» 
follows that they hold on its closure. This completes the proof of part (ii). 

The proof of part (i) is based on theorem 6 of chapter 13, which says that the 
extreme points of the closure of the convex hull of a set S belong to S, provided both 
sets are compact. In order to apply that theorem to 5 = [Pq], we have to verify that 
both Do and {Pol are compact sets. To see this, we note that the topology we have 
imposed is the weak product topology 

Yl s >j- 

The entries of S in Dq lie in [0,1]，so Do is a subset of 

n[。，u 
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According to Tychonov’s theorem, this is a compact set. We have shown already ii 
part (ii) that Dq is a closed set; being a closed subset of a compact set makes D, 
compact. 

Similarly, in order to show that {Pol is compact, it suffices to show that this set i 
closed. The matrices Po are characterized by the inequalities (760 and 

Each of these sets is closed; therefore so is their intersection {Pol- 
We now appeal to theorem 6 of chapter 13. It states that given a compact set sue: 
as (Po) whose closed convex hull 一 which by part (ii) of theorem 11 is Do — is als' 
compact, then all extreme points of the closed convex hull belong to the originE 
comnact set. This completes the oroof of oart (i) of theorem 1 L 「 






V t where tij are defined by (69) and 


^i(S) = Y1 S U^ ^ W - 












part (ii) cannot be proved by appealing to the Krein-Milman theorem, since the set 
D is not closed, and therefore not compact. 

Theorem 12 was conjectured by Garrett Birkhoff. The preceding theorems and 
proofs are due to Kiefer and Kendall; see D. G. Kendall. □ 

1 涵兩 § t^Torem 

• 狂 bility~theory1^"Sp 此 e whichfa-allgebr 茲 2 ： is specified. 

The sets in S represent all possible events; a probability, measure on S represents 
the probability of their occurrence. An infinite sequence of occurrences is modeled 
by the direct product Z x Q. The events in Z x QJojm the_smallest.cr-algebra 1hat_ 
contains all the cylinder sets, formed by the groduct sets 

n 。， 

where Ej belongs to the cr-algebra S in and all but a finite number of the sets Ej 

are the whole space Q. 

A probability measure m on the cylinder sets of Z x is called invariant under 
permutations if for all cylinder sets 

m (n~) =;n (n 〜 w); 

where j p(j) is a permutation of the indices, such that p(J)-=j for all but a 
finite number of j. 

The set of probability measures onZxQ that are invariant under permutations is 
clearly a convex set, where convex combinations of measure is defined in the obvious 
way. 

A probability measure m on the a-algebra E in Q induces the product measure 
on the cylinder sets of Z x ^ by the formula 

.w(n~)=n ,n(£ )). — 

Since all but a finite number of the Ej are equal to Q, all but a finite number of the 
factors on the right are equal to 1. Clearly, the product measure on Z x ^ induced by 
a measure on Q is invariant under permutation. 

De Finetti proved the following important result: 

Theorem 13. 

(i) The extreme points of the set of permutation invariant probability measures on 
Z x Q are the product measures. 

(ii) Each measure on Z x Q invariant under permutation can be expressed in a 
unique fashion as an integral over the product measures. 

Clearly this is a Choquet type of result. For a proof, see de Finetti or any advanced 
text on probability. 
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14.10 MEASURE-PRESERVING MAPPINGS 

In this section Q denotes a compact metric space, T a homeomorphism of Q, onto Q. 
It can be shown that there exists at least one probability measure on the Borel subsets 
0 f q that is invariant under T. There may be many. The collection of all invariant 
probability measures forma convex set. The following result of John Oxtoby sheds 
liiiht on the structure of this collection: 

Theorem 14. 

(I) The extreme points of the convex set of probability measures invariant under 
T are those measures with respect to which T is ergodic. 


(ii) Eve) 













THEIR EXTREME POINTS 


m am i + (1 —a)m 2 ： 0 < a < ^ m 2 ： 

WeSrsOake the^aselhaf mi. is.absolutely continuous with respect to m 2 . By the 
- Radon-Nikodym* theorem, 

mj == /mo, _ f nonnegative and in L 1 (m 2 ) - — 

^'^Sincei)oth wi-^d-77r2^are'mvariant r under'^^^^ is */-. Since m\ 丰 爪 2 , _ 1; 

- therefore there exists a po^itive-tmmb&r-e sueh-thaHhe-sets 命 - =4<w|-/ ⑽— >-.(：}• and 

Q 2 = [a)\f(co) < c], both having positive mo measure. Since / is invariant under 
r, r maps Q\ and Q 2 onto themselves. 

Substituting m\ = fmo into the expression for m as convex combination of m\ 
- and m 2 gives 

m = [af + 1 — a]ni 2 - 

It follows that and Q 2 have positive m measures; this shows that T is not ergodic 


The case whenmj is not absolutely continuous with respect to m 2 is just as simple. 
Then there exist sets E whose m 2 measure is zero, but mj (E) > 0. Denote by s the 
quantity 

s = sup m j (£)， nio{E) = 0. 

Let E n be a maximizing sequence: 


limmi(£„) = moiEn) = 0. 


F is invariant under T ; for if not, the set (F U T F) would have m \ measure greater 
than m(F) = s but m 2 measure 0 contrary to the definition of s. 

We claim that in.the decomposition = F U F c , both pieces have positive 777- 
measure. For, using the expression of m as linear combination of m\ and m 2 , we 
get ， 

m(F) > am\(F) = as 


t(F c )>(\-a)m 2 (F c ) = 0-a). 


This shows that T is not ergodic with respect to m. 
For the proof of part (ii), see the article by Oxtoby. 


NOTE. In their work on X-ray crystallography, honored by the Nobel Prize in physics 
in 1986, H. Hauptman and J. Karle made crucial use of Toeplitz’s characterization 
(32) of the Fourier coefficients of positive measures. 









HISTORICAL NOTE. Denes Konig (1884-1944), professor at the Technical Univer¬ 
sity in Budapest, was the founding father of graph theory. He developed many of 
the basic concepts，and wrote the first book on the subject in 1936. His proof of the 
Birkhoff-Konig theorem is graph theoretical. The brilliant Hungarian school in graph 
theory is his legacy. 

Konig supervised the Eotvos mathematical competitions for high school students. 
He was extremely kind and encouraging to budding young mathematicians, includ¬ 
ing the writer of these pages. 

When the German army occupied Hungary in 1944, putting Hungarian Nazis in 
power, Konig saw what was coming and threw himself out the window of his apart¬ 
ment. 
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BOUNDED LINEAR MAPS 


In chapter 2 we studied some rudimentary properties of linear maps M of one linear 
space into another. Here we impose topological structures on the linear spaces and 
on the mappings themselves. Alternative names for maps, and synonymous with it, 
are operator and transformation. 

15.1 BOUNDEDNESS AND CONTINUITY 

Definition. X and f/ area pair of Banach spaces. A linear map (actually any map) 
M:X^U 

is called continuous if it maps convergent sequences into convergent ones, that is, if 

x n — x implies 3VLc, 2 — Mx. (1) 

Here convergence is reckoned in the sense of the norm in X and U, respectively. 

Definition. A linear map M : X U of one Banach space X into another U is 
called bounded if there is a constant c such that for all xmX 

|Mx|< c|x|. (2) 

Theorem 1. A linear map M : X U of one Banach space X into another U is 
continuous if and only if it is bounded. 

Proof. It is easy to show that a bounded linear map is continuous, even Lipschitz 
continuous. 

Conversely, if M were not bounded, (2) fails for any c, say n, for some x, say x n : 
|Ma- /7 | > n\x n \. 


Normalize x n so that |jc w | = 1 /y/n\ x n tends to zero but Mx n does not. Clearly, (1) 
is violated, so M is not continuous. □ 
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Suppose that the spaces X and U on which and into which M acts are merely 
normed linear spaces, not complete, and suppose that M is bounded in the sense of 
(2). Then M can be extended by continuity to a bounded mapping of the comple- 
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We have shown there, in theorem 1, that if N is closed, as is, then the quantity 
IWI is a norm. Using the definition (3) of the norm of a map，and some obvious 
manipulations, we have 


We turn to defining the transpose of a bounded linear map M : X U, X and 
U normed linear spaces. Let ^ be a point of U\ the dual of that is, i is a bounded 
linear functional on U. The composite 名 (Mjc) is a linear and bounded functional of jc: 

^(IVLv) = fCr). (7) 

The linear functional 与 € X’ clearly depends linearly on i\ 

^ = M f L (70 

M’ :U f ^ X 1 is called the transpose of M. 

The transpose of a bounded linear map is the infinite-dimensional generalization 
of the transpose of a matrix; it is an enormously useful concept. In studying and 
using the transpose，it is convenient to denote the action of the linear functionals by 
parentheses as follows: 

m = («,= 

where u e U, l e U\ x e e X\ In this notation the relations (7)，（70 defining 
the transpose can be rewritten as 

(NLy, i) = (.r, M ; t). (8) 

We recall from chapter 8, (see theorem 7’）the definition of the annihilator R 1 - of 
a subspace of a normed linear space U as the subspace R 1 - of U 1 consisting of all 
bounded linear functionals i that vanish 1 on /?• Similarly, for any subset 5 of X\ we 
define 丄 as the subset of those vectorslF^ffi 心 re annlEnSStf fjy^^ vectoff in"*" 
S. Clearly, S 1 is a closed linear subspace of X. The basic properties of transposition 
are summarized in 


Theorem 5. __ _ __ 

…… (i) The transpose M’ of a bounded linear map M is bounded, ancl 

|M，| = _• (9) 

(ii) The hiillspdce ofM^is •tlre~mnVftltmrofth-ercuTg-e~vf : M7 
= -^M- 


( 10 ) 













|M'| = sup sup |(x,M^)| = sup |(Mjc,£)|. 

W=i 1 ^=】 |«|=l=|.v| 

According to theorem 6 of chapter 8, for every u in U, 


\u\ = max \{u,t)\. 
|€|=i 












STRONG AN 】 


Exercise 3. Denote by I the identity map X — X. Show that I’ is the identity map: 
X， — X\ 

In a complex Hilbert space the notion of transpose is replaced by adjoint, defined 
by the analogue of (8) and denoted by an asterisk: 

(Mx ，30 = (;c ， M，). 

For matrices, the adjoint is the conjugate transpose. 

Exercise 4. Show that theorem 5 is valid for the adjoint operation. 


15.2 STRONG AND WEAK TOPOLOGIES 

The norm of linear maps X — U defines, a metric topology in £(X, U) that ii 
sometimes called the uniform topology, in deference to two other topologies that an 
also useful and therefore important: 

Definition- The strong topology in U) is the weakest topology in which al 
functions C-^U of the form M — ^ IVLc are continuous, 又 being any point of X. 

Definition. The weak topology in £(X, U) is the weakest topology in which all lin 
ear functionals of the form M — ^ (IVLx, t) are continuous, x being any point of } 
and i any point in U ; . 

Exercise 5. Define the weak* topology in £(X, U% X， U Banach spaces. Sho\ 
that there is a natural one-to-one correspondence between £(X, U r ) and C(U y 
and that this correspondence is continuous in the weak* topology. . 

Of equal importance are the corresponding notions of sequential convergence: 

Definition. A sequence of bounded linear maps: X — U 、 X, U Banac 
spaces, is called strongly convergent if 

s - lim (l- 


exists for every x in X. 


{M ；I } is called weakly convergent if 


exists for all j in X. 


— lim M". 
n-^oo 


(15 
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It is easy to show, and is left as an exercise, that a strongly or weakly convergent 

sequence of maps has a limit M, in the sense thartt5~) ，（ 15’) are equal tcrMbn-We - - 

will denote these relations as — limM„ = M and iu - limM；, ..== .M . —— .= 」 . .- 

Exercise 6. Prove that if — lim = M，then w — lim = M 7 provided that X 
is reflexive. (Hint: Use the definition of weak convergence; see (18) below.) 

No such^esult holds for^trong convergence;^ake ^ md-fcH30th-to-be-theflilbeFt - 

space £ 2 (see chapter 6) consisting of vectors 

丨 =•••), ll^ll 2 = . . . -- 

Define M ；I to be k 

M/jX = 0,0, • • •)• 

It is easy to see that s - limM n = 0. Since i 2 is a Hilbert space, it is self-dual; 
take l = (b],b 2 ,., the relation (M w ^, i) = a n b\ = (x, M f n l) shows that M! n i = 

(0,. • • ，办 i ， .. .）• Clearly, s — limM^£ does not exist unless b\ = 0. 

The significance of these notions is that maps of interest are often ― one is tempted 
to say usually 一 constructed as limits, uniform, strong, or weak of sequences of ap¬ 
proximate maps. The following result, as important as it is trivial, is used all the 
time: - 

Theorem 6. Let X， U be Banach spaces, M 7I a sequence of linear maps: X U t 
uniformly bounded in norm: 

|M« I < c for all n. (16) 

Suppose further that 

s — lim M. n x 

exists for a dense set of x in X. Then {M H } converges strongly, Le. the s-limit exists 
for all x in X. 

Exercise 7. 

(a) Prove theorem 6. 

(b) Formulate and prove an analogous theorem for weak convergence. 


15.3 PRINCIPLE OF UNIFORM BOUNDEDNESS 

Uniform boundedness turns out to be not only convenient for proving strong or weak 
convergence, but it is necessary as well. 





COMPOSITION 


Theorem 7. Let X and U be Banach spaces ， {M v } a collection of bounded maps 
X U t such that for each x in X and each i in U f t (M v .x, 0 Is bounded by a 
constant that only depends on x and i: 

l(M y x,£)| <c(.v，0 for all M v . (17) 

Conclusion: {M v } is uniformly bounded, meaning that (16) holds. 

Proof. We appeal to the principle of uniform boundedness, theorem 4 of chap¬ 
ter 10: If [u v ) is a collection of points in a normed linear space U such that for every 
linear function l in U\ \t(u v )\ < c(l) for all u v ，then there is a constant c such that 
\u v \ < c. We apply this result to u v = Mj，and conclude that for all v, 

|M u .r| < c(x). (17，) 

Next we appeal to theorem 2 of chapter 10: If {f v } is a collection of real-valued, 
continuous, subadditive, and positive homogeneous functions defined on a Banach 
space X, and if at each point x of X, < c(.r) for all v, then there is a number 
c such that 


/vU) < c\x\ for all .r, all v. 

We identify the functions f v with f v (x) = Clearly the f v are homoge¬ 
neous, subadditive, and continuous. According to (17’) above, the f v (x) are bounded 
at every point. Therefore the / v are uniformly bounded, which in our case means that 
|M„x| < c\x\ for all x in X, as asserted in (16). □ 

The relation w — limM/j = M means that (150 holds for every x in X, which in 
turn means (see definition 1 of chapter 10) that 

lim (Mnx, t) = (3VLv, l) (18) 

n-^oo 

for all i in (J ; and all x in X. Since a convergent sequence is bounded, it follows that 
condition (.17) ofJ±Le.orem_7_is.satisfied;Jhexefore_b-y.theorem„7,..the_sequence is 
uniformly bounded. 

Corollary 7’. A weakly convergent sequence of maps of one Banach space into an- 

.. . 


15.4 COMPOSITION OF BOUNDED MAPS 


We turn now to the discussion of composition, called the product, of a map M: X 

r r _i 曹 _ xt . r r xir nr>i_ •_____ i; _ _i ; _ _ ^ r __ __ ^ 
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Theorem 8. Let X, £/, W denote Banach spaces, M andN bounded linear maps, 
M:X^U, N:U-^W. 

Then the composite NM is a bounded linear map: X — W with the following prop- 
.erties: • • 

(i) Submultiplicativity, |NM| < |N| |M|. 

— 否厂 (丽厂 = w : …’ . 

Proof. Applying inequality (4) twice, we get 

..…- 闹 . <-j N( 雨 f. 

Applying definition (3), we get 

|NM| = sup < |N| |M|. (19) 

We turn to (ii): applying (8) twice, we get 

(NMx,m) = (Mjc, N r fn) = (x, (20) 

□ 


Exercise 8. Prove that multiplication of maps is a continuous operation in the strong 
topology on the unit balls of C(X, U) and £(t/, W). 

Definition. Two maps A and M of a linear space X into itself are said to commute if 
AM = MA. ^ 

Exercise 9. Let X denote a Banach space, A a bounded map: X — X that com¬ 
mutes with each of a collection {M v } of bounded maps X —>■ X. Show that then A 
commutes with every map M that lies in the closed linear span of the set of maps 
{M u ) in the weak topology. 

Exercise 10. Show that in a complex Hilbert space (NM)* = M*N*. 


15.5 THE OPEN MAPPING PRINCIPLE 

The next group of results, the open mapping principle, and the closed graph theorem, 
goes considerably deeper than the foregoing material. These ideas are due to Stefan 
Banach; their validity is far from being intuitively clear at first glance, or even a 
second one. 

Theorem 9. X and U are Banach spaces, and M ' X — U a bounded linear 
mapping ofX onto all ofU. Then there is ad > 0 such that the image of the open 
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unit ball in X under M contains the ball of radius d in U: 

MB v (0) D B d (0). 

Proof. Denote by B n the open ball of radius n.around the origin 
XovU. Since M is assumed to mapX onto U ， and since the unir” 
of X, it follows that UMB,! = t/. Since the Banach space U is 
from the Baire category principle that at least one of the sets M 

open set. Some translate of this set is dense in some ball around 

range of M is all of U, by linearity of M we take that transla 
M(B n — .ro). The set B n - xq is contained in the ball of radius 
origin. So by homogeneity of M，we conclude that MBi(O) ir 
some ;• > 0. Consequently for any c > 0， 

M5 C (0) is dense in B cr (0). 

We want to show now that any point u in B r (0) is the image « 

⑼： 

Mx — u. 

This point-x in Bo(0) is constructed as an infinite series 

OP 

x = Y i x j- 


u roiiows rro 
choose x m to 


It follows from (22), 
We noted in chapj 
norms \xj\ of ase 
converges stro 


.geo 











. . . . -1x I < ^ I < > : j~ = 2. (25) 

Since M is a bounded map, letting m — oo in (24c), we conclude that Mx = 
^Mxj=u ： - .. … □ 

—— Theorem 9 has a - 勝础抬 F&stkig-aa dj g ip Qrtarit-GQnsequericesHhe first one 
is the open mapping principle: 

Theorem 10. X and U- are Banach spaces, M.. X — U a bounded linear map onto 
all of U. Thenl^l maps open sets onto open sets. 

This is an immediate corollary of theorem 9. □ 

Theorem 11. X and U are Banach spaces, M:X U a bounded linear map that 
carries X one-to-one onto U• Then the algebraic inverse of M is a bounded linear 
map of U — X. 
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as x = y + z, y in Y, z in Z. Denote the two components y and z of x by 
—• 寸 ~ --- 

(i) Py and T?z are linear maps of X on Y and Z, respectively. 

(ii) =P y , P| =P z ;P y P z =0. 

(iii) Py and Pz are continuous. 


— P/to/—P arts-® - and 彻 prove part (ii!) we observe that since F 

and Z are closed, and the decomposition is unique，it follows, that the graphs of Py 

and Pz are closed. The closed graph theorem does the rest. □ 


A"map satisfyihgT 2 =^Tls^called & prvjeciioii. 一 : 

We conclude this chapter by observing that complete metric spaces have proper 
subsets, called sets of second category, that are not unions of a denumerable number 

of nowhere dense sets. This allows a sharpening of the open mapping principle: 

Theorem 16. X and U are Banach spaces, M: X —U a bounded linear map 
whose range is a subset ofU of second category. Then the range ofM is all 
ofU. i ' 


Exercise 13. Prove theorem 16. 

HISTORICAL NOTE. Stefan Banach (1892-1945)，a Polish mathematician, was one 
of the founding fathers of functional analysis. Banach spaces are named in recogni¬ 

tion of his numerous and deep contributions, and for having written the first mono¬ 
graph on the subject (1932). He was the inspiration of the brilliant Polish school of 
functional analysis. 

During the Second World War, Banach was one of a group of people whose bodies 
were used by the Nazi occupiers of Poland to breed lice, in an attempt to extract an 
anti-typhoid serum. He died shortly after the conclusion of the war. 

The Nazi attitude toward Poles is epitomized by the following story. After the 
conquest of France in 1940, when Hitler ruled most of Europe, a leading German 
mathematician, a member of the Nazi party, called on Elie Cartan, the dean of French 
mathematicians, to discuss the organization of mathematical life in the new European 
order. Cartan wanted to know how the Polish mathematicians would fit in. “Oh，” the 
German replied, “the Fiihrer has declared the Poles to be subhuman.” 
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EXAMPLES OF BOUNDED 
LINEAR MAPS 


An important class of linear maps is furnished by integral operators. The first part 
of this chapter is devoted to investigating their boundedness in various norms. Let T 
and S be Hausdorff spaces, equipped with measures n and m. K denotes an integral 
operator mapping complex-valued functions f on T into complex-valued functions 
8 on S: 

g(s) = (Kf)(s) = J^K(s t t) f(t) dn(t). (1) 

The complex-valued function K(s, t) is called the kernel of K; /, K are assumed 
to be measurable and restricted so that (1) defines a measurable function g. Each 
subsequent theorem reveals a natural class for /, K, and g. We recall from chapter 4 
the L^-norms: 

\f\LP = (^J T \f{t)\P dn^y . 1 < /? < OO. 

The space LP{T,n) is the completion of the space C 0 (T) in the L^-norm. The space 
LP(S,m) is defined analogously. The space L°° is the space of essentially bounded, 
measurable functions. 


16.llBOUNDiD~&i ： SS—C^iN^ESl^irOPERATORS 


We start with conditions that guarantee that (1) is a bounded map ffbm'L 1 ^,«) or 
L°°(7» to 乙 1 OS, m) or m). 

Theorem 1. 


(0 The map K defined by (1) is bounded as a mapping L 1 L°°, and 
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\K\<sup\K(s7t)l 

_ SJ 


' … provided'that the quantity on the right is < oo. 

(ii) K is bounded as a mapping L 00 L 1 , and 


ITrl J 

f, 


IK '-J 

J 



provided that the quantity on the right is < oo. 

(iii) Kis bounded as a mapping L 00 — L 00 , 


|K| < sup j^\K(s,t)\dn(t) 


if the quantity on the right is < oo. 

(iv) K is bounded as a mapping L 1 - » L 1 , and 

|K| < sup J t)\dm(s), 

if the quantity on the right is < oo. 

Proof. By (1), for any s in »S, 

|g(^)l < f T \K(s,t)\\f(t)\dn{t). 

The right side is < sup, |^(j, f)l l/1/.i, so 

Igjpp = sup |g(j)| < sup |AT(j, 01 l/li,' _ 


( 2 ,) 


( 2 //) 


( 2 ///) 


( 2 /„) 


⑶ 


This proves (2, ). 

Integrate (3) with respect to dm over S: 

|g| L , = J^\g(s)\dm(s) < J J 011/ ⑴ I 如⑴ 必”⑴ 

= J[f \K(s, t)\dm(s)\\f(t)\ dn(t). ⑷ 

The right side is 

< J J \K(s,t)\dm(s)dn(t) |/|oo ； 

this proves (2"). 
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The right side of (4) is also less than 

sup J \K(s,t)\dm(s)\f\ L r, 

this proves ( 2 / v ). 

The right side of (3) is less than 

j T \K(s, t)\dn(t)\f\ L co- 

this combined with (3) proves (2///). 口 

Note that when K(s y t) and f{t) are both positive, the sign of equality holds in 
both (3) and ⑷. From this it is not hard to deduce 

Corollary 2\ When the kernel K(s t t) in (1) is nonnegative, the sign of equality 
holds in (2///) and (2/ y ). 

7 The transpose’of K is easily written down; denote by ( , )5 and ( , ) T the standard 
L 2 scalar product on S and T with respect to dm and cln, respectively: 

(g ， h)S = j g(s)h(s)dm(s), ( 5 ) 

d f)T = j 丁 k(f) f(t) dn(t). (5 ’） 

Multiplying ⑴ by h(s) and integrating gives 

(K/, h) s =： JJ K(s, t)f(t)h(s) dn{t) dm{s) = (/, K'h) T , ⑹ 

ST 

where 

(K’/z)(/) =z J^K(s, t)h(s) dm(s). ( 6 ) 


In words ； thrkernel of thrfrmi^ose ^ - — 

of-the .variables s and r interchanged. 

We recall now theorem 5 of chapter 15, according to which the norm of IC equals 
the norm of K. We verify this in the case when the kernel K is nonnegative, and K 
is regarded as mapping L X {T) into L'(5). According to corollary 1', |K| is given 

by formula ( 2 ,„). K ， ,—on.the.—-other. - 

given by formula (2///), with the roles of s and f reversed. Clearly, |K'| = |K|, as it 
should be. 




We turn now to the L 2 -norms; -which we denote as || ||; the corresponding norm 
of K is denoted as || K || • ■ _ : _ 


Theorem 2. The map K defined by (1) isbowidedhs a map: L 2 X 2 ; and 

m 2 <Jf \K 2 (s,t)\dmdn, (7) 


provided that the quantity on the right is < oo. 


— Proof. Applying the Schwarz inequality (see chapter 6) to the integral on the right 一 
in 0), we get.. .. — - - - 


\g(s)\ 2 < j 严 2 (s,_n J^\f(t)\ 2 dn. 
Integrating both sides dm gives 

llgll 2 <JJ \K(s,t)\ 2 dndm\\f\\ 2 , 


as asserted in (7). — - □ 

Inequality (7) is due to Hilbert and E. Schmidt. Another criterion has been given 
by Holmgren: 

Theorem 3. Kas defined by (1) is bounded: L 2 L 2 , and 


||K|| 5 (sup/|A ： Cy,/)|£^) (sup/I 尤 Cm)I 加 ) ' ⑻ 

provided that the quantity on the right is < oo. 

Proof. According to theorem 1 in chapter 6, 

(9) 

We are going to use (9) to estimate g = K/. By (6), 

(g ， h)s = j j K(s, t) f(t) h(s) dndm. (10) 


、 l/2 


㈣ 卜 ㈣ 急 


For any three positive numbers /, h and c, fh < cf 2 /2 + h 2 /2c, so the right side of 
(10) is 
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< J J l^^.0l{^l/(f)l 2 + ^|/i(j)| 2 J dmdn. 

In the first term we integrate first with respect to in the second with respect to t. 
We get the estimate 

^sup J 1^(5, f)l dm ll/ll 2 + 1-SUP J 01 dn\\hf. (10’) 

We take now ||/|| = 1 = ||/j|| and choose c so that (10') is as small as possible. This 
minimum is 

(sup/) 1/2 (sup/) 1/2 . (10^) 

Combining ⑼ and (10) with (10〃)，we conclude that for ||/|| = 1, ||K/|| is < the 
quantity in (10"). In view of the definition ||K|| = sup ||K/||, ||/|| = 1, this proves 
⑻. □ 


16.2 THE CONVEXITY THEOREM OF MARCEL RIESZ 


The two factors appearing on the right in (8) are the square roots of the quantities 
appearing on the right in (2 m ) and (2,- u ). We noted in corollary 1， that for a positive 
kernel these quantities are not merely upper bounds for the norm of K : L 00 L°° 

and L 1 -> L 1 but equal to these norms. 


Definition. Denote by M(p, q) the norm of 

K : L P (T, n) — >■ l/ 1 (S ， m) • (11) 

For integral operators whose kernel is > 0, we can restate inequality (8) as follows: 
M(2, 2) < 1) M l / 2 (oo,oo). 

This turns out tobe a special caLejf.aJk.moxe_generaLtheor.enLdue.to M_.Riesz..... 


Theorem 4. Let M be a linear map of complex-valued, functions defined on T into 
complex-valuedfunctions defined on S. Suppose thatM carries functions measurable 





LW(T ， n) — L qo (S, m) and LP' (T, n) ^ L clt (5, m). 
Conclusion: then M is a bounded map ofL p ^(T, n) -> m ) w here 



0 < a < 1. 


(12) 






(120 


Fiirthennore, M(p\ q) & h t?/z • 炫汾戈咐 左 : 

M(p(a),q(a)) S M ] ^ a (p 0i g 0 ) M a {p\,q\)\ 

here M(p, q) is the norm of the operator K defined in (11 ),. 

Proof. We sketch Thorin’s beautiful proof of this theorem. The starting point is 
the following result due to Hadamard: 

Three Lines Theorem. Let be a bounded analytic function in the strip 0 < 
Re ^ < 1. Denote 

NM = snp[f(a + iri)\ 

Then 

N(a)<N ] ^ a (0)N a (l). (130 

Proof. Set c — log N(0)/N(l);by (13)，the function 0(0 代 is in absolute value 
< A^(0) for Ref = 0 and Ref = 1. So, by the maximum principle applied in the 
strip 0 < Re f < 1, 

|^(a + irj)\e ca < N(0)\ 

from this and the definition of c, (130 follows. □ 

We turn now to the mapping M; by definition of the norm, 

M(p,q) = sup \Mf \ Ltf . 

\f\ L P^ 

Furthermore, according to theorem 5 of chapter 5—Holder’s inequality and equality— 
for any g in L q 、 \g\ig = sup^i^^, \(g, /?)^|, where q' is dual to q. 

Combining the last two, with g = M/, we get 

mp ， q)= sup |(M/,/a)|. (14) 

We take p = p(a) 9 q = q{a) as defined in (12). The complex-valued functions / and 
h can be factored as / = \f\e ifl , h = \h\e iv . For any f in the strip 0 < Re f < 1 
we define 


/(O = h(X) = 


(15) 
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where p(a )， p(f)，and so on, are defined by formula (12). Note that f{a) = /, 
_ = h. Since l/p(f) and l/q(() are linear functions off, so is \/q f {0- There¬ 
fore /(f) and ) are analytic functions of f, and so is 

= (M/(f),/j(f)) s = J (150 

Lemma 5. Let f and h be functions of unit norm, \f\ p{a ) = 1 \h\ q i{a) = l, and 
9(0 defined as above. Define N{a、as the supremum of\(p(^)\ on the line Re f = ci; 
we claim that 


"(0) £ Mipo^qo), N(l) $ M{p\,q\). 
Proof. Let’s take Ref =0. Then f = /”， and so by formula (12 )， 


/? ⑷ p{a) . q’(a) q r [ci). 

i +imag ” — = 


p(0 


From (15), 


i/_A = i/C i_)i 》= 叫 


- l;,l?’( a ) 


(16) 


(17) 


(18) 


Since / and h were chosen so that \f\ LPitt) = 1, \h\ Lq > la) = 1, it follows from (18) 
that \f{ir])\ Ln — 1, \h{ir})\^ — 1 . Therefore, 


\Mf(ir])\ Lm < M(p Q ,q Q ). 


(19) 


Estimate 诊 defined in (15’）by Holder's inequality; using (19), and \h(iri)\ ，- = 1 , 
we get L<l ° ' 

| 0 (/? 7 )| = |(M/(i'^),/j(/? 7 )| < |(M)/(/? 7 )| iTO |/z(i •切 $ M(p 0 ,q 0 ). 

This proves the first exactly,the same fashion. 


We apply now the three lines theorem, (13’)，to ip defined by (15'); using (16 )， we 
-get___ 

… .. ( 20 ) 

Since /⑷ = /， h(a) = A, by (15’) 

― _ _ 0(^0 =_ (M/ t h). __ 

According to (14). the supremum of the right side over all /， /z of unit norm is the 
norm of M: 
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Mipiajlqia)) = sup..|0(a)|. 

Using the estimate (20) for the right side, we obtain the desired inequality (120. … □ 
16.3 EXAMPLES OF BOUNDED INTEGRAL OPERATORS 


• —_-Theorems,! and 3 _both_fumish criteria for an integral operator to be bounded is 
L 2 L 2 . These criteria are very far from being necessary for boundedness, and are 

insufficient for proving the L 2 Lr boundedness of the most important and most 
beloved mappings. We illustrate this on a number of exmnples.... 


16.3.1 The Fourier Transform 
The Fourier transform is defined by 

(F/) ⑷ =7 ⑴去， （21) 

whose kernel is 

K{s, t) = ^=e - is \ ( 21 y ) 

r = S = E，m and « Lebesgue measure. 

Clearly, for this K the right side of both (7) and ( 8 ) is oo, so neither theorem 2 nor 
theorem 3 can be used to show the L 2 L 2 boundedness of the Fourier transform. 
Yet it is well known that it is bounded; see theorem 21 of Appendix B..On the other 
hand, we can use part ⑴ of theorem 1 to conclude that F : L 1 L°° is bounded 
by 1 /V^tF. We can now appeal to the M. Riesz convexity theorem, theorem 4, with 
(po,qo) = ( 2 , 2 ), (pi, q\) = ( 1 , oo) to conclude, after a brief calculation. 

Theorem 6. For \ < p <2 } ¥ is a bounded map ofL p L p ’( p - and 
/ i \ & 一 pVp 

• (22) 

This inequality is called the Hausdorff-Young inequality after its discoverers. 

16.3.2 The BQIbert Transform 

Let h(t) be a real-valued function on R, fairly smooth 一 C 1 will do — and tending to 
zero as \t\ -» oo at a reasonable rate, say 0 (r"~ 2 ). 

The Cauchy integral 




( 23 ) 




















J J 


as asserted in theorem 7. 
Exercise 1. Show that 


H 2 = —I, where I = identity. 


'(27) 


(Hint: Consider the relation of the real part of —i/(?) to its imaginary part). 

_Note that the kernel of H, _ _ _ 

K(s,l) = —, 


(28) 


fails miserably the tests for boundedness given in theorems 2 and 3. 

The argument used above to show that H is an isometry of L 2 L 2 can be used 
to prove: 

Theorem 8. The Hilbert transform K is a bounded map of L p L p for all p, 

I < p < oo. 

Take p = 4, and consider the analytic function / 4 . By Cauchy’s theorem, 

丰 f 4 d^ = 0. (29) 

We choose the same contour as in (26) and let € 0, i? oo, to obtain 

f ⑽)+卿) 4 处 =0. 

Jr 

The real part of this relation is 

f (h 4 - 6h 2 k 2 + k 4 ) d$ = 0. (290 

Jr 

According to a well-known inequality, for a, b, c positive ab < ca 2 /2 + b 2 /2c\. 
applying this to a = h 2 , b = k 2 ,c = 6, gives 

6h 2 k 2 < 18/2 4 + i/: 4 . 

Selling this into (290, we get 

\ J k 4 d^ <17 J lM. 

This shows that W : L 4 -» L 4 is bounded, and that|//| <34. 












By a change of variable we can write the second integral as 

e~ s, r l f 2 dt= e- l, u-^ 2 du f " 2 = Cs ,， (35) 


"wHere' 


~C = / - e ： rM- a =l/2- c j ir：= ~~l - e r-X 2 x rrJ 2xdX' ='2* T ~ e~~ X ~C 

Jo .Jo Jo 


(36) 


Setting (35) into (34) gives 


lg(^)| 2 < Cs_、’ 2 \-f(t)\ 2 e- s, t } / 2 dt. (37) 

Jo 

Integrating (37) gives 

llgll 2 = 厂 lg ⑷ I 2 心 <C^°°y \f{t)\ 2 e- s, t x l 2 s- l l 2 dtds. (38) 

Interchange the order of integration, and change variables in the ^-integral: 

e - St t” 2 s~” 2 ds= e- li ir ] / 2 du=C. 

Jo Jo 

So we get from (38) that 

M 2 < c 2 ||/|| 2 . 

Using the value of C given by (36), we conclude that ||L|| < To show that 
equality holds, take /(f) = 1 iJt for a < t < b, zero outside this interval; for this 
choice ||/|| 2 = log b/a. Set g = L/; it is not hard to show that as a tends to zero 
and b to oo 9 \\g\\ 2 > TtiY- e) log b/a. Combined with ||L|| < this proves (33). 

^ □ 

Again note that the kerne] of L, e~ st , utterly fails the criteria for L 2 boundedness 
contained in either theorem 2 or theorem 3. 


Exercise 3. Prove that the Laplace transfom L is not bounded as a map of 
ZAR+) — ZAR+), except for p = 2. (Hint: Try / ⑴ = e"~ ar .) 

As remarked in chapter 15, theorem 8, if L is bounded, so is L 2 ; therefore it 
follows by submultiplicity from (33) that 

||L 2 ||<||L|| 2 = 7r. (39) 

We claim that in (39) the sign of equality holds. To see this, we note that the kernel 
of the integral operator L, e~ st 9 is a real symmetric function of s and t. It is easily 






verified (see formulas (6),(60) that an integral operator L with a symmetric kem 
satisfies 


(Lu r u) = (m, Lu). 


(4 






16.4 SOLUTION OPERATORS FOR HYPERBOLIC EQUATIONS 

We recall from-section 5 of chapter -11 the class of symmetric hyperbolic operators 
of first-order. These are first-order partial differential operators of the form 

■ ■ , a __•• ““ . __ L_= A jdj 4- jB, . .. 〜 _ (42) 

........ - j " 

The Aj, B are n x n matrices with real-valued entries that are reasonably smooth 
functions of s. We take them to be periodic in L acts on vector-valued functions 
u{s), whose components are real- valued,- and assumed -reasonably smooth, periodic 







S(T): u(0)^u(T); (48) 

S(T) is called the solution operator. Theorem 12 states that for each T the solution 
operator is bounded in the norm L 2 (F) L 2 (F). 

Proof. Assume first that the matrix K in (45) is > 0 for all s. Take the scalar 
product of (46) with 2u, and integrate over F. Using (43)，we can write 

2{u,ii t ) + 2(u,Lu) = 0 ， 

so by (45 ; ) 

2(«, w/) + (u, Ku) = 0. (49) 

The first term can be written as d{u y u)/dt. Therefore, if the symmetric matrix K > 
0, it follows from (49) that ||m ⑴ || is decreasing as function of 7; from this (47) 
follows for all T > 0, c == 1. 

If K is not positive, introduce v by u = e kt v as new dependent variable; set this 
into (46) to obtain v t -\-(k + L)v = 0. For k large enough, K >0, so v satisfies 
(47) with c = 1 . Therefore u satisfies (47) with c = e kT ， T > 0. □ 

Obviously the proof works also if u is not periodic in s but tends to zero as |j| —• 
oo so fast that u G 


we iaKe now a special example or an equation ox lorm 



Denoting u = (v f u;/, we can write (42) componentwise as 


v t + v x + w y = 0, 

Wt — W X + Vy = 0. 



v “一 Vxx — Uyv = 0, 

也 tt — Wxx — u；y y = 0, 


the classical wave equation. There is an explicit solution formula for the wave equa¬ 
tion that puts the solution .operator. S (rj in_the_form.of_aa.lntegraLopeEatoc-.Xhe_ 
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16.5 SOLUTION OPERATOR FOR THE HEAT EQUATION 


We consider solutions m(jc, t) of the heat equation. . _ ____ — 

u t = iixx (50) 

that are defined for all x and all r > 0, 如 d whic•h_z^J}_•suffiqie|lll>u:apidI>L.asL 二 .• 
Jx | —> oo. _ _ _ _ ___ 

Theorem 13. Let u(x 、 t) be a solution of the heat equation, as above. Then for all 

r >0 


⑴ l“(r)|max ：£ l M (0)|max- * 

m \u(T)\ Li < \U(0)\ L U 

m \u(T)\ L 2 < _) i l2 . 

Remark 2. Since (50) is linear, these estimates show that u is uniquely determined 
by its initial value and that the dependence of u on its initial data is linear. Therefore 
the solution operator 

S(r) : u(0) u(t) (51) 

is well defined. In terms of it, theorem 13 can be formulated so: |S(r)| < 1 as an 
operator mapping L p **-» L p , p = oo, 1,2. 

Proof. Let k be an arbitrary positive number. Define v(x, t) to be 

v = ue- kt .. (52) 

Then u satisfies the equation 

v t +kv = v xx . (50’) 

Since u(x, t) was assumed to — 0 as |jc| oo, the same is true of v{x, t). It 
follows that in the strip 0 < t < T, 一 oo < x < oo the function \v(x, /)| takes on 
its maximum. We claim that this is at a point where t = 0. Say that the maximum 
occurs where f = T. If T) > 0 at this point, then the first term on the left in 
(50’）is > 0, and the second term on the left is > 0, while the term v xx on the right is 
< 0. At a negative minimum we find an analogous contradiction. So it follows that 

max \v(x, t)\ = max|u(x,0)|. 

0</<7,jc a* 

This shows that i; satisfies property (i) of theorem 13. Letting — 0 in the definition 
(52) of v shows that also it satisfies (i): 


|S(r)| < 1, 


S : L 00 L 00 . 




defined for 0 < t < T, and all x, which tends to zero sufficiently rapidly as |.r| -> oo. 
Multiply equation (50) by w, (53) by «, and add; the result can be written as 

(uw) t = vni xx - uw xx . 

Integrate this with respect to x on K; integrate by parts. The fact that v, w tend to 
zero as |,r| -> oo shows that the integral of the right is zero. So we get 


that is, f uw dx = («(f), w(t)) is independent of f, in particular, 

(tt(0), io(0)) = (w(T), w(T)). ( 54 ) 

Denote the inintia] value u(0) by /; in the notation (51), u(T) = S(T)f. Similarly 
denote the final value w(T) by g. Analogously to what we have shown about solu¬ 
tions of (50), for t < T, w(t) is completely determined by w{T), and there is a linear 
relation between w(T) and iu(0) that we denote by S': 

w(0) = S'(T)g. 

We rewrite (54) in this new notation as 

(/.S^r^^CSCD/^). (55) 

The bracket (w, w) is a bilinear function: for fixed iu, it is a linear functional of w and 
for fixed w, a linear functional of w. Thus (55) says that S and S' are transposes of 
each other with respect to this bilinear pairing. 

It is easy to verify that 

l«lt> = . I.(«. «0 丄 _ _ __ 

According to part (i), |S , (r)g|„ m . r < so we deduce from (55) that |S(T) f\ L \ 

S as asserted in part (ii). 

Panrtiii), the boundedness of S : Lr Lr, follows from the Marcel Riesz con- 
vexity theorem， - theorem ， 伞 above; ，， - 二一 —- ，〜一 ，，- • 

REMARK 3. Here is another, direct proof for part (iii). Multiply equation (50) by 2u 
and integrate with respect to .r over R. Integrate by parts on the right. By the fact that 
u(x, 0 0 as |jc| — oo, we get 

i f u2dx = - f u ldx. 
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This shows that / w 2 (x ， t) dx is a decreasing function of r, from which part (iii) 
follows. 

A similar direct proof can be given for part (ii). Let Xj(t) be the points where 
u(x, t) changes sign: 



fox Xj < x < Xj + \ y j even 
foiXj < x-<Xj + \, j odd. 


(56) 


Then-- 


卩 ~+|(0 

Differentiate this with respect tot. Using calculus and equation (50), we get 


u(x ' t)dx - 


(57) 


= Yy -1 ) j f ’ j 1 u xx 

= X^(-l) 々 Mx(A>fl) - Ux(Xj). 


(570 


It follows from (56) that the first a : -derivative of u alternates in sign at the points Xj： 
f , f > 0 for j even 

£ 。 for j odd; 

therefore the right side of (570 is < 0. This shows that \u(t)\ L \ is a decreasing 
function of t, as asserted in part (ii). P 


We give now yet another proof of theorem 13; the initial value problem for (50) 
can be solved explicitly: 


U(x, t) = -^= I f(y)e-^-y^dy. 


This shows that S is an integral operator whose kernel is exp{(x — y) 2 /At]/2^/Ttt. 
We appeal to parts (iii) and (iv) of theorem 1 to prove parts (i) and (ii) of theorem 13, 
and to theorem 3 to prove part (iii). □ 


Theorem 13 holds for second-order parabolic equations in any number of space 
variables; the proofs sketched above apply to the general case, except of course the 
last one based on the explicit formula for the solution. 


16.6 SINGULAR INTEGRAL OPERATORS, PSEUDODIFFERENTIAL 
OPERATORS AND FOURIER INTEGRAL OPERATORS 


The above-named classes of operators play a dominant role in modem analysis, in 
particular, the modern theory of partial differential equations. They extend enor¬ 
mously the class of traditional integral operators; they unify integral and differential 




operators. In particular, inverses of various differential operators can be expressed in 
terms of such operators. 

For the theory of these operators we refer to the literature, in particular, to 
Hormander and Taylor. We call attention to a particularly sharp result concerning the 
L 2 boundedness of pseudodifferential operators due to David and Joumee. 
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BANACH ALG EBRAS-AND 

-T 菊 

SPECTRAL THEORY 


17.1 NORMED ALGEBRAS 

We saw in chapter 15 how to multiply, by composition, two linear maps of Banach 
spaces into other Banach spaces, provided that the target space of the first map is 
the domain space of the second map. In this chapter we specialize to the study of 
bounded, linear maps of a Banach space X into itself. Any two such maps may 

be composed so that the set £(Z, X) of such maps forms an algebra, with a unit 

Each element of C(X,X) has a norm, with the properties embodied in theorem 2 
and theorem 8; that is to say, the norm is subadditive and submultiplicative. Such an 
algebra is called a nonned algebra. 

There is a group of important results about bounded linear maps of a Banach space 
X into itself that depend only on the algebraic and analytic structure of C(X, X). 
In this chapter and in chapter 18, we derive these results in the context of normed 
algebras. 

Definition. A normed algebra is an associative algebra over the complex numbers. 
Each element has a positive norm, |M| = 0 only for M = 0. The norm satisfies the 
usual conditions: 

|M + N| < |M| + |N|, |cM| = |c| |M|，and |NM| < |N| |M|. (1) 

A nonned algebra with a unit is just that, where the norm of the unit is 1: 

|I| = 1 •‘ (2) 

Definition. A normed algebra C that is complete with respect to the norm is called 
a Banach algebra. 


192 









Theorem 1. 

(i) If M and Kin Care invertible, so is their product MK, and 

(MK)- 1 =K-UM- 1 . (6) 

(ii) IfM andK commute, 

MK = KM, (7) 

and if their product is invertible, so are M anclK separately. 

Proof, (i) is an obvious consequence of associativity. To show (ii), denote the 
inverse of MK by N: 

(MK)N = I = N(MK). 

By N and K’s associativity, we conclude that KN is a right inverse of M. By com¬ 
mutativity of M and K and associativity, we get 

I = N(MK) = N(KM) = (NK)M, 

- fcom_whicJi_we-condude.that_NKis a/^uVzve/;ye-of-M.-So-M.js-in-vertibl©v_B 

Theorem 1 is purely algebraic; not so 


Theorem 2. Suppose that K in C is inverti 
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Proof. We treat first the special case K = I; we claim that all elements of the form 
一 B are invertible, provided that - 


The inverse of I — B is given by the geometric series 

- 


⑼ 




Clearly, since |B| <1， the sequence of partial sums is a Gauchy sequence; since £ 

-is-complete,the series-ccmver-ges 7 -It-follows^em 4 H^at-a^eflver^en^ei 4 es-eafl-be- - 

multiplied termwise; multiplying (9') on theJeft by B } we get 

OO 00 

BS = B = S -1, 


from which it follows that (I —B)S = I. Similarly, multiplying (90 by B on the right 
shows that S(I — B) = I. This shows that S is the inverse of I — B. 

We return now to ( 8 ); we factor 


K-A = K(I-K- 】 A). 

Set B = K 一 1 A; by submultiplicativity, and by inequality ( 8 ), 

|B| = |k- 1 a|s|k_ 1 | iai < l. 

Using (90, we invert (10): 

(K - A)- 1 = (i -K-' A)- 1 K- ] ( K-lA ) n 
This proves that (K — A) is invertible. 


K _I . 


( 10 ) 


do 7 ) 


Definition. The resolvent set of M in £ consists of those complex numbers 入 for 
which 


XI-M 

is invertible; the spectrum of M consists of those 入 for which it is not. The resolvent 
set of M is denoted by p(M)，its spectrum by cr(M). 

In chapter 11， section 11.4, we defined the notion of an analytic function of a 
complex variable whose values lie in a Banach space over C. Since a Banach algebra 
is, in particular, a Banach space over C, we may speak of analytic functions whose 







Theorem 3. 


(i) The resolvent set p(M) is an open subset ofC, 

(ii) The resolvent of M, defined, on p(lVI) as (fl — M) 一 ! ，abbreviated as (f — 
M) _l , is an analytic function of ( on p(M). 

Proof. Suppose that X is in p(M); then by theorem 2 applied to K = 入 1 — M and 
A = /ii, 

( 入一 /i)I 一 M=( 入 I 一 M-/zI) 
is invertible for h small enough. This proves part (i). 



oo 

((A - h) - M )— 1 = X! ( 人 — Mf - 1 h n - (11) 

0 

this shows that the resolvent can be expanded in a power series around each point 入 
of p(M), convergent for \h\ < [(A. — M) _ 1 1 一 〖， this proves analyticity, as asserted in 
part (ii). □ 



ofM. Then 


|(A.-M)- I |>rf-!. — (If) 


Theorem 4 (Gelfand). 

"7/J Tiie spectrum cr(M) is a closed, bounded, nonempty set in C. 
(ii) The spectral radius of M, denoted ^Hcrfriiyi,Is — 





cr(M) satisfies \\\ < IM^I^^.In view of definition (12) it follows that |a(M)| < 
since this holds for all integers k, 

|cr(M)| ^liminflM^I 1 ^. (16) 

k—rOO 

We turn now again to representation (13) of the resolvent and express the coeffi¬ 
cient of 厂 /2 一 1 by the Cauchy integral formula: 




(17) 








FUNCTIONAL CALCULUS 


As path of integration we may choose any contour C in the resolvent set of M th 
winds once around cr(M). It follows from the definition (12) that | = |cr(M)| + 
is such a contour. We may then estimate from (17): 

|M n | <c(|ct(M)| + 5) ,i+1 , c= max |(^I-M) _1 |. 


Take the nth root. 


| M ’i|l/n < c l/« (|cr(M)| + 5) 1+1/n 
and then form the lim sup, 

limsup|M ,I | 1/n < |a(M| + S. 
n—)*00 

Since this is true for any 5 > 0, it is true for 5 = 0; 

limsup|M n | 1/n <>(M)|. (1 


Comparing (16) and (18), we conclude that the lim inf and lim sup are 
obtain Gelfamf s formula (12’）for the spectral radius. 


□ 


17.2 FUNCTIONAL CALCULUS 


Since L is an algebra，we can form any polynomial p of an element M of C by setting 
N 

= (19) 

(19) defines a mapping from the algebra of polynomials into the algebra £ that is, 
clearly, a homomorphism. This homomorphism can be extended to a larger class of 
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Definition. IM M be an element of C, f(X ) a funetibn analytic in a domMri G 
containing cr(M). Let C 1 be a contour in G fl yo(M) that winds price around every 
point in cr(M) but winds zero time around any point of the complement of G. We 
define . . • "•- ••— 

/(M) = ^(f-Mr>/(Od-f. (21) 

By the Cauchy-integral J;heorem,-(2.1-).-is-independent-of-the-choice-of4:te-Gontoun —— 
Theorem 5. 


— 一⑴ For f and (I9}are : the same ： - 

(ii) The mapping (21) from the algebra of functions analytic on an open set con¬ 
taining a (M) into C is a homomorphism. 

(ni) 

a(/_ = /(a(M)). (22) 

(iv) Let f be analytic on an open set containing cr(M), and g analytic on an open 
set containing f{a (M)). Denote their composite by h, 

HO = ^(/(?))； (23) 

then 

h(M) = g(f(M)y (230 

Proof, (i) Replacing /(f) by a polynomial in (21) and using formula (17) shows 
that (21) and (19) are the same. The same argument shows that (21) is the same as 

(20) for / analytic in a disk of radius > or|(M)_L . 

(ii) For any pair of complex numbers f and co, 

(fl - M) - (col 一 M) = (f — co)I. 

Suppose that both ( and co belong to p(M). Multiply the above identity by 
(? - M)*" 1 (o> - M)— 1 (( - cor 1 : 

(c -w) _1 [(w-M) -1 -(f -M) -1 ] = (f (24) 


Relation (24) is called the resolvent identity. 

The mapping / — /(M) given by (21) is obviously linear. We show now that 
it is multiplicative. Let /, g both be functions analytic in an open set G D a(M). 
We choose two contours C and D, both in G H p(M) so that they have no point in 
common, and so that D lies inside C. That is, C winds once around every point co 
of D, while D winds zero times around every point f of C. Using definition (21) for 
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f and 尽 with contours C and D, we write /(M)g(M) as a product of two integrals, 
which we express as a double integral. Then we use the resolvent identity (24): 


f(M)g(M) = j- - M)-\co -M)- ] f{ ； )g(co)cT^cTco 

=参 - -M)~ l ]f^)g(co)^dco 

=^ ^ - w)~ 1 /(OJ(^ - M)- 1 g(a>) ctco 


Since C winds once around every point co of D, the integral with respect to f above 
in the first term is, by the Cauchy integral formula, f(co). Since D does not wind 
around any point f on C, the co integration in the second term above is zero; so we 
conclude from (25) that 


fm)g(M) = 丰 ( 0 ) 一 M)- 1 f(co)g(Q})ctco, 

which by (21) is /z(M), where h(co) = f(co)g(co). This proves that the mapping (21) 
is multiplicative. 

(iii) We have to show that \x belongs to the spectrum of /(M) if and only if fx is 
of the form 


M = /( 入)， 入 ina(M). 


(26) 


If ix is not of form (26)，then /(?) - /x does not vanish on o-(M). Therefore 
(/(f) 一 is analytic in an open set containing a(M), thus we may 

define g(M) by formula (21). According to part (ii), [/(M) — /x — I] 容 (M) = /i(M), 
where /i(f) = (/(() — M) §(0 — 1- Thus /i(M) = I, and ^(M) is the inverse of 
/(M) - ill. This proves that \x does not lie in cr(/(M)). 

On the other hand, suppose that 从 i& of form (26). Define the function k(^) by 




/(?) 一 /( 入） 


-GleaFtyr^-te-anatytiG-kHan- ❻ p&n-set-tsontaining-o^M-)-，- so- A(-M)- ean. be-defined by- 
-(2.1).-Since (C 一入 ） ) = /(f) 一 /(A.), it follows from part (ii) that 


(M - 人 I) k(M) == /(M) — /( 入 )L 


(27) 


Since 入 belongs to cr(M), the first factor is not invertible. We appeal now to part (ii) 
of theorem 1, according to which the product /(M) — /( 入 )1 is not invertible either. 

(iv) By assumption, g(co) is analytic on /(a(M)). Since by part (iii) the spectrum 
of /(M) is /(cr(M)), it follows that formula (21) can be applied to g in place of /, 
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and/(M) in .place of M and Z) in place of C : 

: —" — —TO?? 1 脾恥， . . (28) 

For co on D, 一 /(())""】 is an analytic function on a(M); therefore applying 
formula (21) once more, we get - - ___ 

— —_ M )- 1 一:一 

provided that the contour C does not wind around any of the points co on D. Now set 
(29) into (28): 一 _ 

5(/W) =— 丰手 “ -M)-、- f(Or ] gia})d ； dco. (30) 

We reverse the order of integration; since C does not wind around points of D, it 

follows that D winds around every point ( of C. By the Cauchy integral formula, 

手 ( ⑴ - f(Or l 8(<o)eTco = g(fO k(a 

where we have used (23). Setting this in (30) on the right we get, by (21), /i(M), as 
asserted in (230. □ 

Definition (21) and properties listed in theorem 5 are called thefunctional calculus 
for operators. Relation (22) is called the spectral mapping theorem. 

Suppose that the spectrum of M can be decomposed as the union of n pairwise 
disjoint closed components: 

a(M)=a\ U-.-Uo*//, 07 门取 = 0 . (31) 

For each j, denote by Cy a contour in the resolvent of M that winds once around 
each point of aj but not ak r k^ j. We define. 

Pj = 卜 - Mr%. (32) 

Theorem 6. 

(i) The Vj are disjoint projections, that is, 

Pj=Pj and P ； P)t=0 forj^k. (33) 

(ii) 


(iii) P m ^ 0 if a n is not empty. 


E p 尸 1 


(34) 







winds 
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of. Relations (33) are corollaries of part (ii) of theorem 5. Since C = J^Cj 
once around every point of cr(M), (34) follows by summing (32) over all y, 
and using (14). We leave the proof of part (iii) to the diligent reader. □ 

Exercise 1. Show that if P is a nonzero projection, that is, satisfies P 2 = P ^ 0, 
then 

|P| > 1. (35) 

Exercise 2. Show that the spectral radius |cr(M)| depends upper semicontinuously 
on M in the norm topology, namely, that if lim = M, then 

lim suplorCM,,)! < |cr(M)|. 


Exercise 3. Show that | exp M| < exp |M|. 


Exercise 4. Show that if 0 does not belong to the a(M), and if 0 can be connected 
to oo by curve that lies in p(M), then log(M) can be defined so that 

exp log(M) = M. 

Exercise 5. Define Cm to be the closure of the algebra generated by M and (( — 
M) _l , f in p(M). Show that Cm is a commmutative subalgebra of C. 


NOTE. For the history of the spectral theory of operators in a Banach space, see 
pp. 607-609 of Dunford and Schwartz. 

The term “spectrum” is due to Hilbert, a remarkable anticipation of its meaning in 
quantum mechanics. 
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GELFAND^ THEORY 
二 tOECDMMCmim — 
BANACH ALGEBRAS 


A Banach algebra, defined in chapter 17, is an associative, complete normed algebra 
over C. The algebras L we deal with in this chapter are assumed to have a unit, 
denoted as I, with |I| = 1, and to be commmutative: 

MN = NM for all N, M in £• (1) 

The main topic, as in chapter 17, is invertibility, approached here through two 
concepts that turn out to be equivalent: multiplicative functionals dud maximal ideals. 

Definition，A multiplicative functional p in a Banach algebra £ is a homomorphism 
of C into C. l 

Although defined purely algebraically, homomorphisms of a commutative Banach 
algebra with a unit have the following analytic property: 

Theorem 1. Every homomorphism p of a commutative Banach algebra with unit 
into C is a contraction, that is，satisfies 

\P(M)\ < |M|. (2) 

P roo f. Since M = IM for every M，and since p is a homomorphism, 

P(M) = p(IM) = p(I) p(M). 

It follows from this that, unless p = 0, a trivial case, 

P(l) = 1. ⑶ 
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Let K be an invertible element of £, that is, KN = I. Then, by (3), 
p(K)p(N) = ^(KN) = p(I)= i ； 


this proves 

Lemma 2. IfKis invertible, p(K) ^ 0. 


Suppose now that contrary to (2), |p(M)| > |M| for some M; then 


M 

7m 


satisfies 


|B| < 1. 

It follows then from (9), (90 of theorem 2 in chapter 17 that 
K = I-B 


(4) 


(40 


is invertible. On the other hand; by ⑷ and (2 )， 

pi K) = p(I)-p(^j = l-\=0 - 

this contradicts the observation in lemma 2: if K is invertible, p(K) ^ 0. Hence (2) 
holds for all M. □ 


The main result of this chapter is the converse of lemma 2. 


Theorem 3. An element K of a commutative Banach algebra C with a unit is invert¬ 
ible if and only if 


P(K)^0 (5) 


for all homomorphisms p of C into C. 

Proof. As observed already in lemma 2, if K is invertible, then p(K) ^ 0 for all 



therC- lS^3t-hOIT10niOI^hlSITl--- j P.-=S--jC='^^ri(C^-SUGhr.tllclt - r-r :-.-.-.- - z.-zs .-ra«rT-s=~.:?i - arrr-r 

P(K) = 0. (6) 

To construct such a p, we need some algebraic and analytic notions. 


Definition. Let £ be a commutative algebra with unit. A subset T of £ is called an 
ideal if it has these three properties: 
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(i) T is a linear subspace of 

(ii) For any M in C, ^____ 

(iii) ZisnontriYial’meaning tliat jyAneiig^yJQl-JO^^^j ^^K. — 


Note that an ideal cannot contain an invertible element N. For then it would follow 
from (ii) that J contains every element of C, contrary to (iii). In particular, J does 



Lemma 4. Let L and A be commutative algebras, with unit over the same field, q a 
homorphic map of C onto A. Suppose that q . is nontrivial in this sense: 


: H) ~ 句 -js-n&tan-iso mor phis m i - - —: 

(ii) q (£) does not consist of {0} only. * 

The kernel of the homomorphism q, consisting of all K in C mapped into 0 by q, 
is an ideal in C. Conversely, every ideal 1 in L is the kernel of some nontrivial 
homomo rphism. 

Proof. It is easy to see that the kernel of ^ is an ideal. To show the converse, define 
^4 to be 

£ 

A = £(mod 2)=—, 

meaning that A consists of equivalence classes of elements of £, two elements M 
and M, being equivalent mod X if their difference belongs to X: 

M = M’ mod X if M - M' 6 J. 

Addition and multiplication of equivalence classes is performed by picking arbitrary 
representatives of each class, adding or multiplying them, and then forming the class 
to which they belong. 

The mapping q is taken as the natural assignment to each element M of £ the class 
of all M 7 congruent to M mod I. Clearly, the kernel of q is I. □ 

Lemma 4\ C t A and q as in lemma 4 t J an ideal in 人 The inverse image of J is 
an ideal in C. 

Proof is obvious. 

We have noted earlier that an ideal contains no invertible elements. Conversely: 

Lemma 5. Every nonzero element K of C that is not invertible belongs to some 
ideal 


Proof. That ideal is the principal ideal KC generated by K. It is easy to veri^ 
properties (i) and (ii); since KC does not contain the identity I, property (iii) holds 
































GELFAND’S THEORY OF COMMUTATIVE BANACH ALGEBRAS 


207 


Theorem 14. C as above, N any element of C, The spectrum ofN is 

or(N) = _)} (9) 

as p ranges over all homomorphisms of C into C. 

Proof. By definition of spectrum, ； belongs to cr(N) iff (I - N is not invertible. 
According to theorem 3 this is the case iff /?(fl — N) = 0 for some p. Since, by (3 )， 
p(l) = 1, it follows that ( lies in cr(N) iff f = p(N) for some p. □ 

We show next how to use the characterization of the spectrum contained in theo¬ 
rem 14 to give a new proof of the spectral mapping theorem. We recall the functional 
calculus developed in chapter 17; formula (21) there defines /(M) as 

= /(f)^ (10) 

for every / analytic in an open set containing a(M). According to part (iii) of theo¬ 
rem 4 in chapter 17, equation (22), 

(/_) = /(a(M)). (11) 

The integral (10) is the limit in the sense of the norm of the usual partial sums em¬ 
ployed in defining a Riemann integral. It follows that for any bounded linear map 
€:£->€ we may apply i inside the integral on the right in (10): 

Uf(M)) = ^ ((? -M)- 1 ) /( ⑽. （100 

In particular, (10’）holds for every homomorphism p of £ C: 

p(f(M)) = M)- 1 ) /(f) cTf. (12) 

Since /? is a homomorphism, , 

Setting this into (12) gives 

下⑺ M)T=r f 即 厂卜 7(^ 抑: .. ’ — V3) 

It follows from theorem 14 that /?(M) belongs to cr (M). By construction, the contour 
C winds once around every point of cr (M); so by the Cauchy integral formula the 
right side of (13) equals /(p(3VI)). This implies that 

P(/(M)) = /(p(M)). (14) 
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According to theorem 14, as p runs through all homorriorphisms the left side of (14) 
fills up the spectrum of f(M) while the right side fills up /(a(M)).So we conclude 
.that (11).holds. □ 

The homomorphisms p constructed in this chapter can be regarded as functions 
of the associated-maximal -ideal A4 and of the element N of the algebra £: 

_______ —… - p = p(M N). - (15) 

For fixed N, p is a function on the space J of maximal ideals. 

Definition. The functions p defined above constitute tht.Gelfand representation of 
飞 picd~oftts~maximaTideais. 

Theorem 15. 

(i) The Gelfand representation is a homomorphism of C into an algebra of 
complex-valued functions on the set J. 

(ii) The representation is a contraction: N)\ < |N|. 

(iii) The spectrum ofN is the range of the function representing N. 

(iv) The unit I is represented by p(M, I) = L 

fvj The functions p separate points of J; that is, given two distinct maximal ideals 
M. and M!, there is an N such that 

piM^N) ^ p(M\N). 

Proof. Part (i) expresses the fact that each pis a homomorphism; part (ii) restates 
(2), part (iii) restates theorem 14, part (iv) restates (3), and part (v) follows from 
theorem 13. 

Definition. The natural topology on the space J of maximal ideals is the coars¬ 
est one in which all functions p(M, N) s N fixed, are continuous. This is called the 
Gelfand topology. 

Theorem 16. J is compact in the Gelfand topology. 

Proof. Consider the product space 

p = PJDini ， (16) 

C 

where D r denotes the disk | < r in €. Each disk is compact; therefore by Ty- 
chonov’s theorem so is their product, P, in the product topology. By (15), p(M, N) 
lies in the disk D|N|. We map J into P by assigning to each M of J the point 


07 ) 







APPLICATIONS OF 
GEEFAND^THEORY^ 
OF COMMUTATIVE 
B^NACT ALGEBRAS 


19.1 THE ALGEBRA C(S) 

<S is a compact Hausdorff space, C = C(5), the algebra of continuous, complex¬ 
valued functions on <S, normed by the maximum norm: 

|/| = max|/Cs)|. (1) 

s 

Given any point r on S, we can associate with the homomorphism p r : £ C: 

Prif) - f(r). (2) 

As noted in theorem 13 of chapter 18, the kernel of p r is a maximal ideal 

M={/:/(r) = 0}. ⑶ 


Theorem 1. Every maximal ideal M. in C(S) is of form (3). 


Exercise 1. Prove theorem 1. 

Theorem 1 shows that the maximal ideal space of C (S) can be identified with S 
itself; the abstract theory gives nothing new! 

19.2 GELFAND COMPACTIFICATION 

5 is a locally compact Hausdorff space, Cb(S) the algebra of all complex valued, 
bounded ，continuous functions on S, normed by 
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|/| = sup|/(j)|. (4) 

Given any point r in <S, M r defined by (3) is a maximal ideal. We claim that if 5 
is not compact there are others. We demonstrate this when 5 = E ： 

Let {^} be a sequence of points tending to oo. Define 1 to consist of all / such 
that 

n lim o /( J „) = 0. (5) 

Clearly, J is an ideal, and equally clearly, the functions in 1 have no zeros in com¬ 
mon. As observed in chapter 18,1 is contained in some (in fact many) maximal 
ideals M\ yet clearly, M is not of form (3). 

Although theorem 1 fails in the noncompact case, the following is true: 

Theorem 2. The set of maximal ideals M r of form (3) is a dense subset 、 in the 
Gelfand topology, of the space of all maximal ideals of Cb(S) when S is a locally 
compact Hausdorjf space. 

Proof. Let M.qo denote a maximal ideal. Open sets in the Gelfand topology that 
contain Moo contain M that satisfy for some € > 0 

\p(M,hj)- p(Moo,hj)\ <e, 1 < 7 < k, ⑹ 

where hj are elements of the algebra, and p(M) is defined by equation (15) of 
chapter 18. Set hj - fj -f cy,cy = pWoo,we can write (6) in the form 

\p(M,fj)\ <e, p(Mco, fj) = 0, 1 < ； < k. (7) 

We claim that every open set of form (7) contains some M r . For suppose not, then 
for every r inS,M = M r does not satisfy (7). Since by definition (14), chapter 1 8, 
and definition (3) of M r , p(M r , fj) = //(r), violating (7) means that for all r t 

max|/y(r)| > €. ⑻ 

Since by (7 )， p(Mooy fj) = 0, fj belongs to Moo, j = 1,..., /c. Since is 
an ideal, it follows thaffor 础 itrar^ bound^Tc 而 fci 而而 s-fifn^trons. gJT [ 百 j f，/ aiso 
Berongs"t6 yVtoo. Choosing gj = f j ， we conclude that 

/ = Ei/ ； i 2 


( 9 ) 







for every r in S. This shows that / is an invertible element；'and thus cannot belong : 

to an ideal. This contradiction shows that every neighborhoQd-Qf-^M-oe-CQntains-SQme - 

points,.. ■ — 口 

The maximal ideal space of Ct(S) in the Gelfand topology, is called the Gelfand 
compactificadon of S. This space contains a dense subspace homeomorphic to 5; 
the restriction of the space of continuous functions on .the..GelfanclcompaGtification—-— .. 

to this subspace js-the-space^f-aU-boimded_cQntinuoiis_fiiiictions£)iLiS-_ 

The next example is more fun. 


t9.3—ABSOLUTELYXrONVERGENTFODRIER SERIES— —— -- 

£ is the algebra of all complex-valued functions f(6) on the unit circle S 1 that have 
an absolutely convergent Fourier series, namely of the form 

f{Q) = J2c n e ine , (11) 

I/I = (12) 

It is easy to verify that the norm (12) is submultiplicative: \ fg\ < \f\ |g|. So £ is a 
Banach algebra. The function / = 1 is its unit; its norm equals 1. For each point co 
of 5 1 , the mapping 

PM) = / ㈤ ） （ 13) 

is a homomorphism of £ C. Conversely: 

Theorem 3. Every homomorphism into C of the algebra L of functions on 5 1 with 
norm (12) is of form (13). 

Proof. According to theorem 1 of chapter 18, every homomorphism /? of a Banach 
algebra C has norm 1. Since e ie and e_ iG both have norm 1, it follows that 

|p(^)|<l, \p(e- i6 )\<l. (14) 

Since p is a homomorphism, 

p{e w )p(e- ie ) = p{\) = \. (15) 

Combining (14) and (15), we conclude that \p(e ie )\ = 1; therefore we can write 

p (〆)=〆' co real. (16) 

Since p is a homorphism, p{e inQ ) = e tn(iJ for all integers n, and for all finite sums 
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( 17 ) 

Since p is continuous and J2 \ c n\ < oo» (17) holds for infinite sums as well which 
converge in the sense of the norm (12). This proves that p is of form (13). I | 


According to the main result of chapter 18, theorem 3, an element / of a Banach 
algebra L is invertible if /?(/) ^ 0 for all homomorphisms p of L C. In view of 
theorem 3 above we conclude 


Theorem 4. If a function f defined on the unit circle has absolutely convergent 
Fourier series, and doesn’t vanish at any point of S^, then its reciprocal /— 1 also 
has absolutely convergent Fourier series. 

This celebrated theorem, due to Norbert Wiener, is astonishing, for there is no 
obvious relation between the Fourier series of a function / and of its reciprocal. 

The situation is similar in more variables: 

f{9) = ^2,c n e in ' e , 9 = /i = 〜， … ， n 々， 

[|c„i < oo. 

The analogues of theorems 3 and 4 hold, with similar proofs. 


19.4 ANALYTIC FUNCTIONS IN THE CLOSED UNIT DISK 

A is the algebra of all functions f(z) analytic in the open unit disk | 艺 | < I， and 
continuous up to the boundary \z\ = 1. Clearly, 乂 is a Banach algebra under the 
norm 

l/l = max |/(z)|. (18) 


Given any point w in |z| < 1, the mapping 

Pw(f) = fW (19) 

_is a homomorphism of A -^ CJDonversely:_ 

Theorem 5. Every homomorphism of the algebra AC is of form (19). 

Proof. By theorem i, chapter 18, such a homomorphism p has norm < I. Since 
according to (18 )， f(z) = z has norm = 1， it follows that 


\p(z)\ < 1. 


( 20 ) 








?■ 
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Denote the value of p(z) by w\ since p is a homomorphism, p(z n ) = w n and for all 
finite sums 

p = J2 a J wJ - (21) 

We can express (2ij so: when f(z) is a polynomial, /?(/)== /(iy). Since every 

- j£_iiU/4-can-be_approximated uniformly-on |z| < 1 by.polynomials,.and.since-/?, is.. 

continuous, this relation holds for all / in A as asserted in theorem 5. □ 

Exercise 2. Show that every function can be approximated uniformly on the 

unit' pol}m6iTnaIsr — ■ 

Theorem 6 - Let fu...，fmbea collection of m functions in A that have no com¬ 
mon zero in the disk. Then there are functions g \,..., g m in A such that 

sl . ( 22 ) 

Proof. Consider the set X of all functions of form 

f = h J inA - ( 23 ) 

Unless X is all of A, it is an ideal in A, and therefore it is contained in some maximal 
ideal A4. According to theorem 13 of chapter 18, is the null set of a homomor¬ 
phism. By theorem 5, all homomorphisms are of form (19); therefore a maximal 
ideal is a collection of all functions in A that vanish at some point w. Since the func¬ 
tions fj, j = 1. m were assumed to have no common zero, they cannot belong 

to the same ideal. This means that (23) cannot be an ideal, therefore the functions / 
of form (23) are all of A. In particular, / = 1 is of that form; this proves (22). □ 

The situation is similar for analytic functions of k complex variables, defined in 
the polydisk 

n(M < i) 

and continuous up to the boundary. The analogues of theorems 5 and 6 hold, with 
analogous proofs. 


19.5 ANALYTIC FUNCTIONS IN THE OPEN UNIT DISK 

The algebra B of bounded analytic functions in the open unit disk |z| < 1. These 
form a Banach algebra under the norm 

I/I = sup |/(z)|. (24) 

|2|<1 
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Every mapping of form (19) ， |u;| < 1， is a homomorphism: S therefore the 
set of / satisfying 

f(w) = 0, |u;| < 1 (25) 

is a maximal ideal M w - Not all maximal ideals in B are of this form. However，the 
following is true: 

Theorem 7. The set of maximal ideals M w of form (25) is a dense subset, in the 
Gelfand topology, of the space of all maximal ideals. 

The analysis presented in the proof of theorem 2 shows that the theorem is equiv¬ 
alent to the following proposition: 

Theorem 7’. Let f\,... t f m be a collection of functions in B that have the property 
that 

(26) 

for every z in \z\ < 1. Then there exist m functions gj e B such that 

HgjfjEV (27) 

Theorem 7 is called the corona theorem. The implication (26) (27) is subtle 

and deep. It was shown to be true by Lennart Carleson, using function theoretic 
arguments. Tom Wolff succeeded 1979 in giving an entirely different proof, using 
methods of partial differential equations (e.g., see Koosis). 

19.6 WIENER’S TAUBERIAN THEOREM 

The next application deals with a Banach algebra without a unit. This is easily reme¬ 
died by adjoining a unit I in a purely formal fashion. That is, if X is a Banach algebra 
without a unit, the enlarged algebra C consists of elements of the foffTTXr 干 
a complex number, M an element of A. Addition is defined componentwise，and 
multiplication according to the distributive law: 

QH-FNr(XI + M) = + /xM-f A.N + NM ： 

Note.that 乂 is a maximal ideal in C, . 

Norm in the enlarged algebra is defined as follows: 

|AI + M| = |i| + |M|. 

Clearly, this new norm is subadditive and submultiplicative, and the unit I has 
norm 1. 
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We take X to be the space L 1 of complex-valued integrable functions on R, which 
we denote by lowercase letters. We define multiplication to be convolution: 

(/ * 容 )⑷ = / f(s - u) g{u)du. (28) 

JR 

The change of variable of integration s — u == u shows that convolution is. commuta- 



Lemma 8. Convolution is submultiplicative in the L ] -nonn t that is, 

•••"• . ' ._ 兑 \f*8\Q < 1/ln lglL»- _ • 一 : _J29)_ 

P roo f. Suppose that both / and g are continuous and of compact support. Then 
so is / * g ，and it is the L 1 -limit as A 0 of finite sums of the form 

J2f(s-JA)g(jA)A. (290 

J 

Note that the L 1 -norm of a translate of / is the same as the L 1 -norm of/; therefore, 
using the subadditivity of norm, we conclude that the L^norm of (29 ; ) is bounded 
by 

I/Il»E^A)|A. (30) 

We let A tend to zero; the sum in (29’）tends to f*g, and (30) tends to \f\ L i |g| L i ， 
so we obtain in the limit inequality (29). □ 

Actually the proof presented above applies to any norm for the function / that is 

translation invariant, leading to the following important inequality: 


I/*^I<I/IUIli, (31) 

where I | stands for any translation invariant norm for functions on K. From now 
on all norms in this section denote the L^-norm. 

We denote by C the convolution algebra of L】functions augmented formally by 
a unit, which we denote by e. We proceed now to determine all maximal ideals of £， 

or rather, what is equivalent, the multiplicative linear functionals p whose nullspaces 

they are. One of the maximal ideals in the convolution algebra is L ] itself. For any 
other p, there is an / in L 1 such that p(/) # 0; we normalize / so that 

M/) = l. (32) 

Let t be any real number, and denote by f t the translate of /: 




( 33 ) 











Lemma 9. f t depends continuously on t in the sense of the norm, that Ls\ 

/l^O ^ t+h 一力 I = 0 . (34) 

Proof. This is clearly true when / is continuous and has compact support; since 
every f in L l can be approximated in norm by such functions, (34) holds for all /. 

□ 

We define now the function x (0 as 

X(0 = 〆/，). (35) 

Since p is a continuous linear functional, it follows from lemma 9 that x is a con¬ 
tinuous function. Since the norm of f t is independent of t, it follows that x(0 is 
uniformly bounded by |/| for all real t. 

Lett and r be two real numbers; we claim that 

ft+r *f = ft* fr- (36) 

To see this, we apply the definition (28) of convolution to the left side: 

Ut+r*f)(s) = J f(v-t ~r)f(x-v)clv. 

Introducing v — r — u sls new variable of integration transforms this to 

J f(u - 1) f(s - r 一 u) du; 

this is ft * f r , as asserted in (36). 

Now let p act on (36). Since p is multiplicative, we get that 

P (ft+r) p{f) = pift) p(fr). 

Using the definition (35) of x and the normalization (32), we deduce that x satisfies 
the functional equation 

xG + d = X( 0 x(r). (37) 

...We have.already seen—that—x is continuous ancLbo unfied; according to a we ll known_ 

result of analysis, all such solutions of (37) are purely imaginary exponentials: .- 二…一 

X(0 = 〆 &， H real; . - —.( 38 ) - 

Having determined the action of /? on / and its translates, we want to determine 
its action on all g in the algebra. We start by remarking that since p is continuous, and 
p(f) # 0, /? does not vanish in a ball of small enough radius around /. Since / can 
be approximated arbitrarily closely in norm by continuous functions with compact 
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. support, there is such a function for which p does not vmiish. This shows that we 
may take / to be continuous and of compact support. 

Let ^ be any condnuous ftnction of compact support. Then thglum (29’) tends in 
the sense of the-L 1 -norm-to-/-*-g. Using the notation (33), we can rewrite (29') as 

J2fj A gUA)A. . . .. .(391. 

-- - Let p-aet on the-preeeding sum; sinee -/? is4inearrwe-get 

- bining-( - 抵 ) - - 

jy 邮 gu 邮 . (40) 

—~ - -As~A-->~Qr(4Q-} 4SRife4Q . . .... . .:- 

l(^) = f e^ v g(v) dv, (41) 

JR 

that is, to the Fourier transform of g except for the factor 1 /^/n. Since (39) tends to 
f 卑 g, and since p is continuous, it follows that p(f * g) = |(f). Since p is multi¬ 
plicative, and p(f) = 1, it follows that 

P(f ^ 8) = P(g) = 8(^) (42) 

for all continuous g of compact support. Since both p and the Fourier transform 
at ^ are continuous functions of g f it follows that (42) holds for all g in iJ. We 
summarize: 

Theorem 10. Every multiplicative linear functional of the convolution algebra L 1 is 
of the form (42 )， 与 some real number Conversely, for every (42) is a multiplicative 
linear functional. 

Proof. The first part has been demonstrated above; the second part paraphrases 
the well-known fact that the Fourier transform of / * ^ is the ordinary product of the 
Fourier transforms of / and g: 

/ = □ 

Theorem 11 (Wiener). Let f bean function on M whose Fourier transform f( 与 ) 
is nonzero for any Then the translates of f span all ofL', that is, any L' function 
can be approximated in norm by linear combinations of translates of f. 

REMARK 1. The condition of the nonvanishing of / is necessary, for if / vanishes 
at rj, so does the Fourier transform of any translate of /, of their linear combinations 
and L 1 limits of them. 

Proof. When g is continuous and of compact support, / * 客 is the L 1 limit of the 
linear combination (290 of translates of /. It follows from lemma 8 that for any g 
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in L 1 ，/ * g is the L 1 -limit of / * g lu where g n are continuous, of compact support, 
and tend tog in the L l -norm. Therefore, to prove the theorem, it is sufficient to show 
that the space of functions / * g , 尽 in L 1 ， is dense in L 1 • □ 

Lemma 12. Let f be a function as in theorem 11, and m an O function whose 
Fourier transform m has compact support. Then m can be written as 

m = g in L 、 (43) 

Note that the L 1 functions whose Fourier transforms have compact support are 
dense in L 丨 ； therefore theorem 11 follows from lemma 12. 

Exercise 3. Show that the set of L l functions m whose Fourier transform has com¬ 
pact support are dense in L 1 • 

Proof of Lemma 12. Choose a compact interval / so large that it contains the 
support of m. Construct an auxiliary function h in L l whose Fourier transform is 
real and has the following properties: 

^ )= {<l everywhere. (44 ) 
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— It follows from (44) and the nonvanishing of / that (46) is positive for all Thus 

.+/*/ c does not belong to the nullspace of any multiplicative linear functional. 

We claim that e - h + f ^ f c is invertible in £; for according to theorem 3 of 
chapter 18, an element of a Banach algebra is invertible iff it does not belong to the 

nullspace o_f_any ,multiplicative linear functional. 

Denote by (the inverse of e 一 h +-/• * f c : . 

(ie — /厂中 y* * — •- - -- - 

Multiply this relation by m: 

~ : . . . . . .~"*X47)~" 

We claim.that (e — h) * m is zero; To see this regard e as the Dirac 5-distribution. 
The Fourier transform of (e - h) * m is (1 - h)m, see section B.5. According to the 
construction (44), 1 一 /! is zero in I while m is zero on the complement of /. This 
shows that the Fourier transform in the sense of distributions of (e-h)^ m is zero; 
therefore so is (e — h) * m itself. Setting this in (47) gives 

f * fc * d * m = 

this is the desired relation (43), with g = / c * d * m. 口 

We give now an indication how theorem 11 is used in applications. 

Let n be a bounded function on M, which has a limit as s tends to oo: 

^lhn^n^) = a. (48) 

Let / be an L ] function, normalized so that 

J f(u)du = l. (49) 

It follows then easily from (48) and (49) that 

* n)(s) = a. (50) 

The question is: can one deduce (48) from (50)? A result of this kind is called a 
Tauberian theorem. 

Theorem 13. Let n be a bounded function on R, and f an L' function normalized 
as in (49). Suppose that (50) holds, that is, that the convolution off and n tend to a 
number a as s tends to oo. Suppose that the Fourier transform of f is nowhere z^ro. 
Then n(s) tends to a in the mean in the following sense: for every value ofd 

1 f s + d 

J s n{u)du = a. (51) 










5*-ALGEBRAS 


Proof, (50) implies that for any t. 


= a. 

(5(Y) 

Taking linear combinations of (50’)，we deduce that for 



(52) 

* n)(s) = a cy = a J hdu. 

(53) 


Clearly, (53) holds for any function h that is the L 1 -limit of a sequence of Fund ions 
of form (52). 

If the Fourier transform of / is nowhere zero, then it follows from theorem 11 that 
(53) holds for all h in L 1 . Take, in particular, 

.,._ f l/d forO < s < d 
lW ^ [0 elsewhere; 

for this h, (53) becomes (51). 

REMARK 2. (51) is not quite (48) but close to it. If, for example, we know that n is 
uniformly continuous on R+, then (51) implies (48). 

Exercise 4. Suppose that n is slowly increasing in the sense that 

sup n{u)-n{v) (55) 

U — [<V<lt 

tends to 0 as u tends to oo. Show that then (51) and (55) imply (48). 

NOTE. Wiener showed howto use his Tauberian theorem to prove the prime number 
theorem. 

Exercise 5. Let / be a function_of_class_ L 2 on R. Show that the translates of f span 
L 2 if and only if / does not vanish on a set of positive Lebesgue measure:— 


(54) 

匚 I 
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Hilbert space.// into itself, its conjugate -transpose, called in this context its adjoint, 
- —~i^anotheroperator- A t ：- ^-^ - 廿 s othat the-pairsatisfies 

(Ajc,y)^(x,A*y) (56) 

for all a:, y in //. Every bounded operator A has an adjoint"A Jtc ； "the"following alge-" 
•. -^kaicjBi^r^sfonowdire^ —: 


A* + B *， (/:Ar = kA*, ~ 一 

A**=A， （AB)* = B*A*. (57) 




Theorem 14. For a bounded, linear mapping Ar. H — H, H a Hilbert space, 


liA|| = ||A*|| ? (58) 

HA*A|| - ||A|| 2 . (59) 

Proof. Take the supremum of the absolute value of each side of (56) for all x, y 
of unit length: ||x|| = ||j|| = 1. On the left take the supremum first with respect to 
: y，then with respect to x; the result is ||A||. Reverse the order on the right, obtaining 
|| A* II; this proves (58). 一 

To prove (59), take y = Ax in (56); estimating the right hand by the Schwarz 

inequality gives 


IIAx || 2 < ||x|| ||A*Ax || < || a -|| 2 ||A*A||. 

Taking the supremum over all unit vectors a- gives ||A|| 2 < ||A*A||. Since by sub¬ 
multiplicity and (58) ， ||A*A|| < ||A*|| ||A|| = ||A|| 2 , (59) follows. □ 

Definition. A complete normed algebra with a * operation that has properties (57), 
(58), and (59) is called a S*-algebra. 


An element A of a B*-algebra is called self adjoint if A* = A; B is called anti- 
self-adjoint if B* = 一 B. 


Theorem 15. 


(0 The spectrum of a self-adjoint A of a commutative -algebra is real. 

⑼ The spectrum of an anti-self adjoint element B is imaginary. 

P^oof. According to Gelfand’s theory the spectrum of A is the set of numbers 
/?(A)，/? being any multiplicative linear functional. We claim that for A self-adjoint, 
p(A) is real; to see this, write 


p(A) = a -f ib. 


(60) 
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Let t be any real number; set T = A + /rl. Then T* = A — itl, and 

T*T = A 2 + f 2 I. (61) 

From (60 )， p(T) =a + i(b + t), so 

\p(T)\ 2 = a 2 + (b + t) 2 . (62) 

According to theorem 1 of chapter 18, every multiplicative linear functional is a 
contraction: 

\pm\ 2 < IITII 2 . (63) 

Using (62) on the left，and (59) and (61) on the right, we deduce from (63) that 
a 2 + (^ + f) 2 <l|T*T||<i|A|| 2 + ? 2 . 


If b ^ 0, this is clearly false for t large and of the same sign as b. So b = 0; this 
proves part (i) of theorem 15. 

If B is anti-self-adjoint, it follows from (57) that iB is self-adjoint. Thus part (ii) 
follows from part (i). □ 


Theorem 16. Every multiplicative linear functional p on a commutative B 年 -algebra 
satisfies 


P (T*)=jm. 


(64) 


Proof. Define A and B as 

T + T* -T — T* . 
A= ~2~' = 

it follows from (57) that A* = A ， B* = t*B. Decompose T and T* as 
T = A + B, T* = A - B. 


(65) 


Since p is linear. 


p(T) =./? (A)“+-/? (B); - -p (: p (A')- —〆 B) (66 )， -- : 

According to theorem 15, p(A) is real and p{B) imaginary; therefore (64) follows 
from (66). 


Theorem 17. For evety T in a commutative B* algebra,- 
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Proof. We first prove (67) for the self-adjoint elements A of the algebra. When 
A* = A, (59) becomes 

II A 2 1| = ||A|| 2 . (68) 

MoJJows from (57) that if A is self-adjoint, so is any power of A; therefore it follows 
from (68) that ... 

_[|A—1|-= ||A-.||?-... .. 

Applying this to n = 1 ， 2,4, 8 , ... ， 2 々 =m and combining these identities, we 
deduce that 

l|A /n || = ||A|r 

Taking the /nth root, we conclude that 

lim ||A m || 1/m = ||A||, m = 2 k . (69) 

k-rOO 

According to theorem 4 of chapter 17, the limit on the left of (69) is equal to the 
spectral radius of A; this proves (67) for T = A self-adjoint. 

We turn now to arbitrary T; it follows from (64) that for any p, 

p (T*T 卜 p(T*)p(T) = |/?(T)| 2 . (70) 

Since every point in o-(T) is of the form /?(T), it follows from (70) that the spectral 
radius of T*T is the square of the spectral radius of T: 

|a(T*T)| = |a(T)| 2 . (71) 

By (57) ， T*T is self-adjoint; since we have already proved (67) in the symmetric 
case, 


||T*T|| = |a(T*T)I. (72) 

Combining this with (71) gives 

l|T*T|| = |or(T)| 2 

By (59) ， ||T*T|| = ||T|| 2 ; this yields (67) for all T. □ 

NOTE. Gelfand invented his theory of commutative Banach algebras to derive the re¬ 
sults presented in section 7. The application to Wiener’s theorem was an afterthought. 

HISTORICAL NOTE. Tauberian theorems were named so by Hardy and Littlewood 
after the obscure Austrian mathematician Alfred Tauber, 1866-1942, who wrote the 
first paper on the subject in 1897. His major contributions were to actuarial science. 
In 1942 he was deported to the Theresienstadt concentration camp. 
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EXAMPLES OF-OPERATORS 


In this chapter we discuss and illustrate phenomena in the spectral theory of operators 
that go beyond the mere fact that operators belong to the Banach algebra of bounded 
linear maps C(X 、 X); we exploit the fact that these are operators that act on elements 
of a Banach space. 


20.1 INVERTIBLE MAPS 


First we address the operation of invertibility of a map M in C(X, X). By definition. 
Mis invertible iff it maps X onto X in a one-to-one fashion. The inverse is necessar¬ 
ily bounded, thanks to the closed graph theorem; see theorem 11 of chapter 15. Thus 
there are only two ways for M to fail to be invertible: 

(a) M is not one-to-one. 

(b) M is not onto,. 

It follows that if the product MK of two maps is invertible, then K is one-to-one, 
and M is onto. If M and K commute, meaning that MK = KM, we deduce from 
invertibility of the product that both M and K are both one-to-one and onto, and 
therefore both M and K are invertible. 

We have seen in theorem 2, chapter 17, that if an element K of a Banach algebra 
is invertible, then so are all nearby elements K — A, |A| < const. For maps we have 
the following additional result: 

Theorem 1. X a Banach space, K : X — X a bounded linear map that maps X 
onto itself. Then all nearby maps, namely those of the form 

K 一 A, |A| < with € small enough, 


also maps X onto itself. 


⑴ 
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This implies that there is a depending on f, such that (f - M) 一 】 w = 义 is large 

compared to it: … • 


(f -M)A- = «, \\x[> 2i^TX[ |M| - ⑺ 

Nowin theorenrlnake-K^= 

-of Krc -Ar—MHrall-of-A^dieirwexarrtaket-^A So close to 入 -that |Af = i ■ 入 - | i s —- 
small enough for (K — A)x = u to have a solution x whose norm satisfies (3). This 
is in direct contradiction to relation (7), which, asserts that the unique solution of 

——: -~~ --- - ~ 

has a norm |^| that is very large compared to |w|. □ 

Theorem 3. Let M be a bounded linear map: Z Z, M 7 •• X’ — X’ its transpose. 
Then 

cr(M f ) = a(M). (8) 

Proof. The proof is based on this simple observation: 

K : X X is invertible iff its transpose K’ ： X’ — X’ is invertible. Now, if K is 
invertible, it has an inverse L: 

KL = LK = I. (9) 

The transpose of these relations is 


LK，= KV = I 7 , 


(9 7 ) 














20.2 SHIFTS 


Take X to be t 2 y consisting of vectors x with complex components: 

jc = (卯，“!， … .）(10) 
The right shift R and left shift L are defined by 

Rx = (0，flo, ai，• ， • ， ）， Lx = (a!， 的， •. •）. (11) 

Clearly, LR = I, RL ^ I, so neither L nor R is invertible. A moment’s calculation 
shows that R and L are transposes of each other: 

R，= L，R = I/， (12) 

Theorem 4. The spectrum of R and of L consists of the unit disk [A.1 < L_ 









ietermine the eigenvalues and eigenvectors of L. Suppose thai 
1 ( 11 ) this means that {a \, ai, 幻 ， •. .）= X(aQ, • • •)» wh 


Dngs to \ a n I 2 < oo\ this is satisfied by~(14)*iff |X| ~< 
eigenvalues of L— 虹 e—all complex - numbers 言， •"|勾 _ ^: - 1 ：7yi~Big 
CL); since the spectrum is closed, all X.in the unit disk belong to o*(L). 

_ cr(L)= 

.(取 this and the previous two statements theorem 4 follows. " □ 


Exercise 2. Show that R has no eigenvalues. 


Exercise 3. Show that the spectrum of R and L acting on the spaces, l < p 
oo, consists of all points of the unit disk. 


20.3 VOLTERRA INTEGRAL OPERATORS 
Take X to be C[0,1], and V to be integration: 

(Vx)(s)= f S x(r)dr. (15) 

JO 



Proof. The /i-fold iterate of V is given by the formula 

(V n A-)(^) = f\s- r) n ~ l x(r)dr, (1 ： 

(n - 1)1 Jo 

as may be verified by induction, using integration by parts. For any s in [0,1] 

|V^-(i)| < [\s-r) n \x\dr<^. 

(n - 1)! Jo n\ 

So for any x in C[0,1], [V^jcI < \x\/n\, which implies, by definition of norm, t\ 
|V| /l < \/n\. It follows from this that 

i 

=0， 


and so by theorem 4 of chapter 17, the spectral radius of V is zero. Since the spec 
trum is nonempty, theorem 5 follows. i 
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Example L X = i 2 ; {入沒} a given bounded sequence of complex numbers. For x 
given by (10)，define 

1VL\: = (入 oao, 人 Pi ， ... ， KiCi n ) • • • • (16) 

Exercise 4. Prove that the spectrum of M as defined by (16) is the closure of the set 

{入《}. 

Exercise 5. Take X to be 1 < p < oo, and define M by (16). Prove that cr(M) 
is the closure of the set {A n }. 

Exercise 6. Suppose that the kernel of the integral operator 

Kf(x)a = [ K(^, 0 f{t) dt (17) 

Jo 

is a continuous function of j ，f in f < s. 

(a) Show that K maps C[0, 1] into C[0, 1]. 

(b) Show that the spectrum of K consists of the single point 0. 

Operators of form (17) are called Volterra operators, after Vito Volterra, who first 
investigated their theory. 


20.4 THE FOURIER TRANSFORM 


Denote the Fourier transform by F: 


(F/)(«) = -^= J f{x)e ixil dx = /(«)• 


We take it as known from the theory of the Fourier transform, see Appendix B, that 
F is an invertible, norm-preserving map of L 2 (R) onto L 2 (K). The inverse is given 



= -J= J f{u)e~ ixu dll. 

,Replacing,x.by-X gives 


f(-x) = ^J f{u)e ixu du. 


Denote the mapping f(x) f{-x) by R, it follows from the formula above that 
-F^-R-Since^R 2 ^ I, it follows that 


F 4 = I. 
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It follows then from the spectral mapping theorem that the spectrum of F lies on the 
set consisting of the fourth roots of 1 : ± 1, ± i. 

Exercise 7. 

™" (a) Show'that F maps the space of functions of the form p(x)e^ x2 ^ 2 , p a polyno¬ 
mial of degree < n, into itself. 

— (•b)~Show*tha1rPiias'-eigenftmctions of form . . . .—- 

⑹ Show that the eigenfunctions in (b) span all of L 2 (3R). (Hint: See section 9.1). 
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COMPACT MAPS 


The notion of compact maps, and their properties, are the bread and butter of func¬ 
tional analysis. 

We recall that a subset 5 of a complete metric space is called precompact if its 
closure is compact. The following are useful criteria for precompactness: 

(a) 5 is precompact if and only if every sequence of points of S contains a Cauchy 
subsequence. 

(b) S is precompact iff for every € > 0 it can be covered by a finite number of 
balls of radius e. 

We turn to precompact subsets of Banach spaces. The following are easily 
deduced from (a) or (b). 

(c) If Ci and C 2 are precompact subsets of a Banach space X, then Ci + C 2 is 
precompact. 

(d) If C is a precompact set in a Banach space, so is its convex hull. 

(e) If C is a precompact subset of a Banach space X, M a linear, bounded map of 
X into another Banach space 17, then M C is a precompact subset of U. 

Exercise 1. Prove statements (c), (d) and (e). 


21.1 BASIC PROPERTIES OF COMPACT MAPS 

Definition. X and U denote Banach spaces. A linear map C : X U is called 
compact if the image C 5 of the unit ball 5 in X is precompact in U. 

Theorem 1. 

(i) The sum of two compact maps: X U is compact. 

(ii) The scalar multiple of a compact map is compact _ 

(iii) Let V be a Banach space，M : U V a bounded linear map, C，• X — U 


compact 
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COMPACT MAPS 


"v ； ( 丄 Z denote a Banach space, N : Z ★ X a bounded linear map, C : X — U 
compact. Then CN : Z U is compact. 

: (v) Let C n : X U be a sequence of compact maps that converge uniformly 
to C, 


_ — . limJC,, ~ C| = 0. (1) 

Xhen,Cds,compact.-. 


Proof. Denote by Ci, C 2 two compact linear maps of X — t/; that means that 
the images = C\ and Co5 = Co of the unit ball B in X are precompact. 
According to (c) above, C\ +C2 is then precompact. Since (C 】 + C2) B is contained 


- 丑邱扣 we~a~subset-ofa'preeompact'set-is~precompact ， -part*<i)-foliowsv- 

Part (ii) is a special case of (iii). Part (iii) in turn follows from property (e) above 
of precompact sets. 

(iv) Since the bounded map N carries the unit ball of Z into some ball in X, CN B 


is precompact. 


(v) Given any € > 0, choose n so large that |C" 一 C| < €• Since C n is a compact 
map,. C n B can be covered by a finite number of balls of radius 6 ； but then CB is 
covered by balls of radius 2e around the same centers. □ 


Suppose that U = X\ in the language of algebra，theorem 1 says that the compact 
maps form a closed two-sided ideal in C(X). According to a theorem of Calkin, this 
is the only closed two-sided ideal when X is a Hilbert space. 


Theorem 2. X and U are Banach spaces, C : X U a compact linear map. LetY 
be a closed subspace ofX, and V the closure in U of CY. 

(i) The restriction of C to Y V is a compact map. 

(ii) Suppose that U = X, and the closed subspace Y is invariant under C, namely 
is mapped into itself by C. Then C : X/Y X/Y is compact. 

Proof. Part (i) is utterly obvious; so is part (ii), once the definitions of the norms 
in the quotient spaces are put in place. 

Exercise 2. Prove that a degenerate bounded linear map D(dim Rd < 00 ) is com¬ 
pact. 

In the rest of this chapter we present a theory of compact operators due to F. Riesz. 
We start by restating lemma 7 from chapter 5, on the geometry of normed linear 
spaces. This lemma will be used over and over again: 


Lemma 3. X is a normed linear space, Y a closed linear subspace of X properly 
contained in X. Then there is an x in X such that 

|x| = l and d(x,Y)= inf \x-y\> L 
y in Y 


⑵ 








theorems are the basic results concerning these maps. 


Theorem 4. Let C be ci compact map of a Banach space X X; denote by I the 
identity map X X; set 

T = I — C. (3) 

(i) The nullspace Nj of T is finite-dimensional, 

(ii) Denote by Nj the nullspace of T J , 

Nj = N tJ . ⑷ 

There is an integer i such that 

Nk = Ni for k > i. (5) 

(iii) The range Rj of T is closed. 

•Prcw/ (i) By definition (3) of T, y lies in Nj if y = Cy. Since C is assumed 
compact, it follows that the unit ball in A^r is precompact. But then according to the 
result quoted above, Nj is finite-dimensional. 

(ii) Assume, on the contrary, that (5) fails for all i, that is, that N[-\ is a proper 
subset of Ni for all i. By lemma 3, there would be for every i a vector yi such that 

yi in iV/, \yi\ = 1, d(y h 昂一 D > 士 (6) 

Take m < n;by definition (3) of T ， 

Cy n -Cy m -Ty n - y m +Ty m . (7) 

The last three terms on the right belong to N n -\, so, by (6)，their sum differs from 
y n by 5 at least. This proves that |C ： y,i — C;y m | > 圣 . Clearly，the sequence (C 














SOF COMPACT MAPS 


We show that then codim Rj = 0, which means that Rj ： is the whole space X. Now 
suppose, on the contrary, that Rj == Xj is a proper subspace of X. Then, since by 
assumption (15), Tis one-to-one, it follows that T X\ = X 2 is a proper subspace of 
X\. Define Xk as X. We deduce similarly that X D X\ D X 2 D • • •, and that all 
inclusions are proper. 

According to part (iii) of theorem 4, X\, the range of T, is closed. We claim that 
every subspace Xk is closed. Indeed Xk is the range of T^ t and = (I - C)^ = 
( 一 1 V (y)^ is, according to theorem 1, of the form I plus a compact operator. 
Therefore we can conclude from theorem 4 that Xk is closed. We appeal now to 
lemma 3; we can choose in Xk so that 

\x/c\ = 1, dist (xic, (16) 

Let /71 and n be two distinct indices, m < n. Then, using the definition (3) of T, 

Cx m — Cx n = x m — Tx m — x n + T X /2 • 








the range of T in A/" . It follows that the dimension of the space of vectors of form 
(20) is •• _ • _ : _ 

dim N — dim Nj：r . . . . (21) 

Combining this with (19) shows that the codimension of Rj in X is equal to dim Nt ：， 
as asserted in (14). □ 


21.2 THE SPECTRAL THEORY OF COMPACT MAPS 

Theorem 6 (F. Riesz). X denotes a Banach space, C a compact linear map of 

(i) The spectrum of C consists of an at most denumerable set of complex numbers 
{X 7l } that accumulate only at 0. If dim X = oo, 0 belongs to cr(C). 

(ii) Each nonzero Xj is a point eigenvalue of C, of finite multiplicity; that is, for 
each X = kj 

the nullspace ofC — k is finite-dimensional, 

there is an integer i such that the nullspace c?/(C 一 X) k is the same as the 
nullspace of(C — X) 1 for all k > i. 

(Hi) The resolvent (f — C) 一 1 has a pole at each nonzero Xj. 

Proof. Define T for f ^0, asT = I-f -1 C. It follows from theorem 5 that if the 
nullspace of T is trivial, meaning that dim A^x = 0, then the range of T is all of X. 
This shows that every nonzero point of the spectrum of T is an eigenvalue. It follows 
from part (ii) of theorem 4 that the multiplicty of an eigenvalue is finite; this proves 
part (ii) of theorem 6. 

We turn to (i). To show that the eigenvalues X n of C can accumulate only at 0, 
consider an infinite sequence {A. w }, X n ^ X m for n ^ m, of eigenvalues, with corre¬ 
sponding eigenvectors x n : 

( 22 ) 

Define Y n to be the linear space spanned byjcj, Since eigenvectors pertaining 

to distinct eigenvalues are linearly independent, Y n ^.\ is a proper subset of Y n . We 
apply now lemma 3, with X = Y n and Y = Y n ^\\ there is a y n in Y n such that 

l^i I = 1 ， l)’n - yl > 2 for all y in Y n -i ， (23)> 

By definition of Y 1U y n is of the form 

n 


yn = 







Thus 


Cy n - X n y n = y^(A/ — X n )ajXj e 


This shows that for n > m, 

Cy n - Cy m = X n y n 一 ; y ， y in y /2 -i. 

So by (23 )， 

\Cy n -Cy m \> 1 ^： 

Since each y n is a unit vector, and since C is assumed to map the 
precompact set, there can be only a finite number of 入 /2 with \X n \ > 
We can restate (25) in a quantitative form. Recall the definition 

function C(e. K) of a Drecomoact set AT in a metric snace： if is thh m 


wc wm soive cms equation ror u in two stages, moose z so large mat 7V/+i = 
where Ni is and denote N( as /Y. Since N is an invariant subspace of C, C 

be interpreted ns ma p ■ _ _ 

.C \X/N —> X/N. (28) 

According to part (ii) of theorem 2, C in (28) is compact. We claim that X belongs 
to the resolvent set of C over X/N, If it did not, X would, by part (ii), belong to the 
point spectrum of C over X/N; that means that some point j in X, ^ 0 mod N, 
would be mapped by C into N. But such a y would belong to N{ + \ and not to jY /， 
contrary to our choice of i. It follows then，since C is compact over X/N, that 入一 C 









is invertible on X/N. Since the collection of invertible maps is open, it follows that 
_f_==_Cis4iiy*eitihIe-for4f~=~X|-sufficiently-sniall，and that 

"1(?-C)- 1 ^ const. (29) 

on X/N for all such ^. '•- - • - - - 

_The first,step in solving (27) is, to, solve the.co ngDje.nce …… - —. 


(v — Cz; = a - mod TV; ~~(30y 

according to (29), (30) has a unique solution v, and |u| < const. \x\. The congruence 

............_ZZZ3—^_ : _ 


— Cv = x — n, n € N. (31) 

The second step is to find a solution z in of the equation 

— Cz = n. (32) 

Adding (32) to (31), we obtain « = u + z as the solution of (27). 

Solving (32) for z in // is a problem in linear algebra. By definition of X as an 
eigenvalue of C，we know that 入 belongs to the spectrum of C over N. Since N is 
finite dimensional, it follows that no ( sufficiently close to A. and ^ X belongs to 
the spectrum of C over N. Thus (32) has a unique solution for such f ， and since 
the resolvent over a finite-dimensional space is a rational function, that solution z 
satisfies \z\ < const. — 入 |—’|«|. It follows from (31) that |n| < const \x\, so 
|z| < const. — 入 I 一 f | 文 |. Combining this with |u| < const. \x\, we get 

|m| = 1 (? - = |u + z| < |u| + Ul < const. If - 入 「 ’Vl. 

This inequality can be expressed so: for ( near enough to X but _ 入， 

i(?-cr 1 i<const.K-xr f . 

In words, near an eigenvalue 入 of a compact map C, the resolvent blows up at most 
like the minus zth power of the distance to X. From this it is easy to deduce, as in 
classical function theory, that the resolvent of C has a pole of order i. This completes 
the proof of theorem 6. □ 

Note that the proof of part (iii) gives another proof of part (ii). 

Exercise 4. Show that the resolvent of C has a Laurent expansion around 入 of the 
form 


—/ 


( 33 ) 









invertible. 


In what follows, X denotes a Banach space, and B a bounded linear operator: 
X X such that for some integer n 

B ；I =C (35) 

is a compact operator. The basic properties of compact operators, theorems 4, 5, 
and 6, are true for operators whose power is compact. 

Theorem 4\ Let B : X X be a map with compact power; set 

S = I-B. (36) 

(i) The nullspace Ns of S is finite-climensional 

(ii) There is an integer i such that 

N S k = N S i for all k > L (37) 

(iii) The range Rs of S is closed. 

Theorem 5’. For S as defined in (36). 

indS = dim N$ — codim R$ = 0. (38) 

Theorem B : X X a map with compact power. 

(i) The spectrum of B consists of an at most denumerable set of complex numbers 
[P n ] that accumulate only at 0. If dim X = oo, 0 belongs to cr(B). 




From this we deduce that 

N T DNs ，_，T C 及 s，—.—— （ 41) 

By assumption B 71 = C is compact; it follows from theorems 4 and 5 that 

. dim i^T < co, , cpdim.gT < PQ-.. . ( 4 _?I 

XSiSKmni1 ； 4 _ 2)H(4inT6T16ws that dim Ns <*oo, codim Rs < ooTByifi^" 
rem 4, Rj is closed; from this and (41) it follows that so is Rs> 


Raising (40) to the kth power gives 



from which we deduce that 


Nj^k D N^k , Rj^k C Rgk ， (43) 

By assumption, B ,2 = C is compact, by part (ii) of theorem 4, N, is independent 
of kiork > i.lt follows then from (42) that the nullspaces N S k ， = 1 ， 2, … are all 
contained in the finite-dimensional space N T i. From this it follows that the spaces 
N S k, ordered by inclusion, are independent of k for k large enough. This completes 
the proof of theorem 4’. 口 

We turn now to theorem &. According to the spectral mapping theorem (see the¬ 
orem 5 of chapter 17)，and since C = B 71 , 

a(C) = a(B n )=cr(B)\ (44) 


merable set of points accumulating only at 0. It follows from (44) that the same is 
true of the spectrum of B; this proves (i). „ … 

Exercise 6. Estimate the number of eigenvalues P of B that satisfy \P\ > € in terms 
of the capacity C(€,B n B). 

We have already proved part (ii) of theorem 6 ’； we leave part (iii) as the next 
exercise. 

Exercise 7. Prove part (iii) of theorem 6’. □ 

We turn now to theorem 5 f . If the operator Q defined in (39) is invertible, then 
it follows from the factorizations (40) that the nullspaces of T and S have the same 
dimension，and their ranges have the same codimension. Thus ind T = ind S; since 
according to theorem 5, ind T is zero, ind S = 0 also, as asserted by theorem 5\ 

To ascertain that Q is invertible, we note that Q is a polynomial in B, and we 
appeal to the spectral mapping theorem, theorem 4 of chapter 17. According to it. 
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ists of complex numbers of the form I + 办 H - h 办"，卢 in 

allows that if a(B) contains no Azth root ol r unity other than 

(B) does contain nth roots of unirv. m'rtnrli 










( 45 ) 











Proof. The defining relation between T and its transpose is 

(T^, t) = (x, Ti). . — 

It follows that the nullspace of T ; is the annihilator of the range of T. 

(i) Since according to theorem 4, Ry is closed, it follows from theorem 8 of chapter 
8 that every vector u that satisfies (w，■£) = 0 for all ■fin 灭士 belongs to.R^ • 一 一一 ― .... 

..(iO.We.saw .in chapter 8, exercise 2,„that. fox a_clo.s 喊 _ 

lator of R is isomorphic with the dual of X//?. Therefore dim 沢丄 = dimCZ//?)'= 
AimX/R = codim R. Apply this to R = Rj. Since the annihilator of Rj is the 
nullspace of T ’， dim Njt = codim Rj. Since, by theorem 5, indx = 0, we deduce 
-that-codirn-i?T ^= dim^x. It follows~tot~dim t 

the Fredholm alternative. - □ 

Theorem 9. A compact map C : X — U maps eve)y weakly convergent sequence 
into one that converges strongly. 

Exercise 10. 

(i) Prove theorem 9. 

(ii) What about its converse? 

NOTE. F. Riesz’s paper was, and is, fundamental for the theory of compact oper¬ 
ators in a Banach space. There he took Hilbert’s definition of a compact operator, 
called in those days completely continuous, and showed how to extend this notion 
to Banach spaces. His accomplishment is all the more remarkable, since 1918 pre¬ 
dates Banach’s fundamental paper by a good five years! And unlike Banach, he treats 
normed linear spaces over the complex field, which is essential for spectral theory. 

HISTORICAL NOTE. Julius Schauder (1899-1943) was the most brilliant of the Pol¬ 
ish mathematicians of fiis time, Schauder bases, the Schauder fixed point theorem, 
the Leray-Schauder degree of a mapping, as well as many fundamental results in the 
theory of elliptic and hyperbolic partial differential equations, are his creation. Be¬ 
ing Jewish, he was killed during the Nazi occupation of Poland. Such things were so 
routine, nobody knows when or where. 
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EXAMPLES OF 
COMPACT OPERATORS 


22.1 COMPACTNESS CRITERIA 

We start by giving some useful criteria in various topologies for sets of functions to 
be compact. The first theorem is the famous Arzela-Ascoli criterion. 

Definition. S is a Hausdorff space; a collection {g} of complex-valued functions g 
on S is called equicontinuous if for every point s of S and every € > 0 there is an 
open set N containing s such that for every r in N and every g of the collection 

I 茗 ( 厂）-洲 I < ^ (1) 

Theorem 1. S is a compact Hausdotff space, {^} a collection of complex-valued 
functions on S satisfying these conditions: 

(i) The collection {^} is equicontinuous. 

(ii) {g、is uniformly bounded: 

< M ( 1 ’) 

for all s in S and every g in the collection. 

nonu.m.S.. _ 

For a proof, we refer to any text on real variables, such as that by Royden. 

Exercise L Show that conditions (i) and (ii) of theorem 1 are necessary for a collec¬ 
tion to be precompact in the maximum norm. 

Exercise 2. Show that if Q is an open bounded set in such that any two points of 
Q can be connected by a path of length < U then a family of functions [g] on Q is 
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precompact in the maximum norm if the functions g and their first partial derivatives 
are uniformly bounded in Q: 

|g| < M, |3/^|<M in 2. 

- Exercise 3. Formulate a version of theorem 1 for functions whose values lie in a 
metric spaced 

Another equally—or even more — important compactness criterion is due to 
Rellich: 

- The 沉 eml 7 Q.is .a~domain in 管 toperrand-bounded，whose boundary-is smooth. 

Suppose that {u} is a collection of functions in Q such that the functions u and their 
first derivatives are uniformly bounded in the l?iQ) norm: 

||m|| < M, P/m|| < M, / = ⑵ 

Then the collection {«} is precompact in the L 2 (Q) noun. 

Proof of Rellich’s criterion is based on Poincare's inequality: for any smooth func¬ 
tion u defined over a smoothly bounded domain O, 

\u(x)\ 2 dx <-\f U — _ + d 2 J2 J q 尚《 1 2 心， ( 3 ) 

where d is the diameter of O. For details we refer to Courant-Hilbert. 

We remark that Poincare's inequality, and likewise Rellich’s criterion, do not hold 
for arbitrary bounded domains. 

22.2 INTEGRAL OPERATORS 

We turn now to integral operators, see equation (1)，chapter 16, where we have de¬ 
fined g = K/ by 

g{s) = j T K(s,t)f(t)dn(t), ⑷ 

s € S，f € T，both compact metric spaces. We will investigate conditions on the 
kernel K(s, t) that make K compact in various topologies. 

Theorem 3. The integral operator (4) is compact as a map — C if the kernel 
K (.v, t) is a continuous function of s and r. 

Proof. To show that K a compact operator with respect to these norms, we have 
lo verify that the functions 


g = K/， |/| L « < 1, 


(5) 







satisfy the compactness criteria (1 )，（ 1’). Since S and Tare compact spaces, the kernel 

is uniformly bounded，so by theorem 1 of chapter 16, K is bounded; thus condition 

(10 is satisfied. To verify (1 )， we study g(r) — g(s): 

\g(r) ~ g(s)\ = |J [K{rj) - K(s y t)]f(t)cln^ < sup \K(r, t ) -尺 Cy J)| |/| L) . 

⑹ 

Since T x S is compact, the kernel is uniformly continuous, so the right side of (6) 
tends to zero as r tends to j*. This shows that the functions g in (5) are equicontinuous. 

□ 

Using an analogous argument we can prove 

Theorem 3’. The integral operator (4) is compact as a map C 4 C if the kernel 
K (s, t) is a continuous function ofs in the L 1 norm with respect to t. 

Next we recall theorem 2 of chapter 16, which says that an integral operator K 
whose kernel K is square integrable is a bounded mapping: l? -> L 2 , and that 

l|K|| 2 < JJ \K{s,t)\ 2 dmdn. ⑺ 

Condition (7) implies more: 

Theorem 4. An integral operator with a square integrable kernel is a compact map 
of I? ^ L 2 . 

Proof. Let [uj] be an orthonormal basis of L 2 (5, dm). We expand K(s, t) for 
fixed t into a series 

t) (8) 

Using the Parseval relation for [uj] gives for almost all values of 4 .. . 

(9) 


Integrating (9) gives _ 二 ___ _ _ 

ffw(s, \Kj(t)\ 2 dn{t). (io) 

Now define 

K N (s,t) = Kj{t)uj{s), (11) 

j<N 


J l^(J, t)\ 2 dm(j) = ^|^(0| 2 . 
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22.3 THE INVERSE OF ELLIPTIC PARTIAL 
DIFFERENTIAL OPERATORS 

We turn now to a class of operators that are defined by solving differential equations. 
Let Q.bea bounded domain in with a smooth boundary; denote by A the Laplace 
operator. It is well known，see section 7.2, that the boundary value problem 

△w = /inQ, t/ = Oon BQ, (17) 

has a unique solution u for every / in C°°(2). 

Definition. Denote the solution u of (17) as 

u =： Sf. (18) 

Theorem 5. S is a compact map of L 2 (Q) into L 2 (Q). 

Proof. Multiply (17) by w, and integrate over Q t integrating by parts: 

-j Q J2\dju\ 2 ds = j^fuds. (19) 

As noted in lemma 1 of chapter 7, for all functions it that vanish on 32 ， 

II«llo <^i|w|ll, (20) 

where \\u\\q denotes the L 2 -norm of u over Q, \\u\\f the sum of the squares of the 
L 2 -norras of the first derivatives of u and d the diameter of Q. Using the Schwarz 
inequality on the right of (19)，and then (20)，we get 

ll«lli <ll/llollM|lo<rf|t/l!oll«lli, 

SO 

ll«lll <rf||/llo. ll«llo < d 2 ll/llo. (21) 

The image of the unit ball in L 2 (Q) under S consists of solutions u of (17) corre- 
- sp.oBding.to-/-with.||/J|o^.lJt-follow.s-from-(2L)4hatthese^atis^fy_ 

ll«lll < d, ||£i||o <rf 2 . (21 f ) 

According to theorem 2, a set of functions satisfying (21’）is precompact in the 
L 2 (Q) norm. This proves that S is a compact operator. □ 


is true even when Q is not smoothly bounded. 
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Theorem 5 can be extended with practically no change to all second order elliptic 
operators with Dirichlet boundary condition. Under the Neumann boundary condi- 
.tion u n = 0 it is necessaiy_to require. 2 to be smoothly bounded. 


22.4 OPERATORS DEFINED BY PARABOLie EQUATIONS —— 

_Our next example concerns solutions of parabolic equ ations of the form__ 

u t = Aw (22) 

:: = :: ::z f 0 dBmgJioxT S u(srt);:s e ~Q _ as ~ in - th 叮 r evio usixampterand~f-^r9dEHs-well-knewii ； - 

and will be proved in the chapter on semigroups, that the initial boundary value 
problem for the parabolic equation has a unique solution; that is, there is a unique 
solution of (23) satisfying 

«(<? ， 0) given in Q, u = 0 on 3 O for all r > 0. (23') 

We denote by S(T) the operator relating u(s, 0) to u(s, T), ? > 0: 

S(D ： m(^0)— T). (23 7/ ) 

Theorem 6. S(T) : LHQjmHQYWdtompact operator for T > d 

Proof. Multiply equation (23) by u, integrate with respect to ^ over Q, with re¬ 
spect to t from 0 to T. We get, after integration by parts, that 


■ J u 2 ds\Q = J j uAudsdt =— ff J2^J u \ 2dsdt 


This proves that 


J u 2 (s, T)ds < J u 2 {s, 0) ds. 


(24) 


namely that ||w (r)||o is a decreasing function of t. In terms of the solution operator S 
defined in (23^) this ca：n be expressed as follows: 


l|S(r)|| <1. (25) 

Next multiply (23) by tAu and integrate with respect to s over Q and with respect to 
t from 0 to T. We get 


j j t u t Au dsdt = j j t(Au) 2 ds dt. 


( 26 ) 







ALMOST ORTHOGONAL BASES 


251 


Integrate by parts on the left with respect to x and using the abbreviation itj for 
the partial derivatives of «, we get the following expression for the left side of (26): 

-f jjl u Y T ^ ds+ \jj Yl u ) dtds - 

Since the right side of (26) is positive, we deduce that 

\ / E u ) (r ) ds ^\fJ 12 u2 j ds dt - 

We can use (24) to estimate the right side above; we obtain 

T J Yl, u Y T ^ ds j u 2 (0)ds. 

Using Rellich’s compactness criterion, theorem 2, and exercise 6, we conclude that 
for r > 0, the image of / m 2 (0) ds < l under S(T) is precompact. 口 

22.5 ALMOST ORTHOGONAL BASES 

The following result due to Paley and Wiener shows that a not too large perturbation 
of an orthonormal basis is a basis: 

Theorem 7. Let H denote a Hilbert space, and {jc /2 } an orthonormal basis of H. Let 
{y/i} ci collection of elements that doesn’t differ too much from {x rt } in the sense that 

~yn\\ 2 < (27) 

Wefurther assume that the [y n ] are linearly independent in the sense that no y n lies 
in the closed linear span of the other y^. 

Assertion. The {^} form a basis in the sense that every u in H can be written 



Proof, (Birkhoff-Rota, and Sz. Nagy) Since the fx,,} are an orthonormal basis, 
every u in H can be expanded as 

- 一 _ 〜 : 一 ■ 叫 r== (» ， x n ) t and ||w|| 2 = ^K| 2 . (28) 

Define the linear map B as follows: for u given by (28 )， 

Bu = Y^ a nyn- (29) 

The series on the right converges; for write Bn -u = J2 a n(yn - Then, by the 
triangle inequality and Schwarz’s inequality, 






P«-m|| < J^ c \an\ llyn -^nll < (yi l^»l 2 ) 1/2 (J2 Ib^-^ll 2 ) 172 < ll^ll c onst 

… - …… .• ：一二-调 

where in the last step we have used assumption (27)，and (28). This proves that B — I 
is a bounded linear map. We claim that B — I is a compact map; for write 

- -Bwnf 仰 十 R w •広 -@ 妁 - 

0 N+\ 



It follows from (32) and (27) that 

Jim IM=0. 

八 ’ 一 ^oo 

This and the decomposition (31) show that B — I is the uniform limit of G^. The 
mapping are degenerate, therefore compact; so B — I is the uniform limit of 
compact maps. According to theorem 2 of chapter 21, B — I itself is compact. 

The nullspace of B is trivial; for if Bm were = 0 for some u # 0, by (29) 0 = 
5Z a nyn would give a nontrivial linear relation among the y n , which is excluded by 
assumption (ii). We apply now theorem 5 of chapter 21 to B = I+(B—I) to conclude 
that the range of B is all of H, as asserted in theorem 7. □ 
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POSITIVE COMPACT 
OPERATORS 


23.1 THE SPECTRUM OF COMPACT POSITIVE OPERATORS 

A classic and important result of linear algebra, due to Perron (e.g., see Lax, p. 196) 
asserts that a matrix with all positive entries has a positive eigenvalue that is the 
largest in absolute value among all eigenvalues. In this chapter we present an infinite¬ 
dimensional generalization. 

Theorem 1. Let Q be a compact Hausdorff space, X = C(Q), K a linear map 
C(Q) C(Q), mapping real-valued functions into real-valued functions. We as¬ 
sume that 

(0 K is strictly positive in the sense that if p is any nonnegative function on Q, 
p 笋 0, then Kp is positive on Q. 

(ii) K is compact 

We claim that then 

K has a positive eigenvalue a of multiplicity one and index one, with positive 
eigenfunction/ ' 

All other eigenvalues \x ofK are smaller in absolute value than a: 

(t)- 

…. .. Proof. The strict positivity of the operator K implies that K is strictly monotone, 
that is, 

if x <y and x then Kx < Ky. (2) 

We consider now the action of K on nonnegative functions. We consider the set 
of all positive numbers k such that there is a nonnegative function x such that at all 
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Its of Q ： 
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operator 


^z{q) = (Ku)(g). 

5 , and since u < y, we deduce from (9) that 
> = (Ku)(q) < (Ky)(q). 


(K 

















a contradiction, caused by assuming that in (8) the sign < holds at point. It follows 
—— that-in-(~8)~eqaatky-heMs~e¥e iywh e FeT We showed-earlier t h 扯出 e-eigenftffletkm- 
z of K with eigenvalue X whose,absolute-value4s4Gr4K)Ms-positwe-Avhen= multiplied 
by a constant, and that A. is real and positive. This proves part (ii) of theorem 1. 

We claim that a has multiplicity 1. For if not, K would have two linearly inde¬ 
pendent eigenfunctions. According to what was shown above, both of these can be 












256 POSITIVE COMPACT OPERATORS 

mass; it is related to Kby 

(Kx ， m) = (jc ， K , 讲 ). "(1^" 

It follows from (16) that K 7 too is strictly positive; that is, that if m is a nonnegative 

measure 卢 0, then K’m is_ a positive measure. K’ and K have the same spectrum. 

Therefore K’ has the same dominant eigenvalue a as K. Since K is compact, so 
is K/. The eigen measure m associated with the eigenvalue a_can be shown to be 
positive by the same argument as used above. 

We are ready to show that K has no generalized eigenfunction with eigenvalue a , 
for suppose that u; were such generalized eigenfunction: 

(K 一 a)w = u, (K — cr)v = 0. 

From (16) we deduce that 

((K — a)w, m) — (w, (K r - a)m) 

Since (K — a)w = v and (K 7 — a)m = 0, we deduce from (17) that 

(u, m) = 0. 

But since u is a positive function and m a positive measure, this is impossible. This 
completes the proof of theorem 1. 口 


Tnr" 
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23.2 STOCHASTIC INTEGRAL OPERATORS 


We consider now an application of theorem 1. 

Theorem 2. Let K(s 、 t) be continuous positive function onO < s,t < l, satisfying 

[ l K(s, t)ds = l for all t. (18) 

JO 

Denote by K the integral operator with kernel K; then 


(i) 1 is an eigenvalue ofK, with corresponding eigenfunction y that is positive. 

(ii) x is any function satisfying 


J x(t)dt = 1. 


(19) 


Then 


lim K n x = >> 
n-^oc 


(20) 
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with respect to Using (18) gives, after changing the order of integration, 
f u(s) ds = f f K(s 1 t)z(t)dtds - f z(t) dt = 0. 


Clearly, the spectrum of K over Z ^ a point spectrum consisting of the spectrum 
of K over X, with the eigenvalue a. .= 1 removed, since a = 1 has multiplicity 1, 
and the corresponding 
part of (ii), theorem 1, 
so the spectral radius rj of K over Z, abbreviated as K y , satisfies 

According to formula (12 7 ) in theorem 4 of chapter 17-，".-. 

In particular, for any 2 in Z, . . ■ —~ ~ — 


(24) 


•么 
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We apply this now to z ： = x - y. Since x satisfies (19), and 3 ； is nonnalized by (21 )， 
z satisfies (23) and so belongs to Z. So (24) applies: 

.Since..)Lis_an.eigenfiinction, y.^.Ky = - - *. == K n y; this proves (17). □ 

which K n x tends to y depends on the size of the second 
largest 吨挪疵 ^ ■师 XTI 丽面而 fes of ffie - 兑 cdhai 秘 sf eig^i\^lue are 
derived in Lawler and Sokal. 

A nonnegative function x(t) that satisfies (.19) can be interpreted as a probability 
— 旣 tend ■ 尺 (‘M) is—t_he__pmbability—density .of Jxansidoa_Erom_state-t-to 

state Relation (18) means that with probability 1 an occupant of state t makes a 
transition to some states. The operator K then models a random process that changes 
a random variable whose distribution has density a* into one whose distribution has 
density Kx. The eigenfunction y normalized by (21) is a probability density that is 
invariant under the random process described by K. 

Given a probability distribution x, K n x represents the probability distribution of 
the system after it has been subjected n times to the random process K. The prob- 
abilistic meaning of the limiting relation (17) is that the n-fold application of the 
random process K turns, as n 00 , any distribution x into the invariant distribu¬ 
tion y. 


23.3 INVERSE OF A SECOND ORDER ELLIPTIC OPERATOR 

Theorem 3. Let h be a differential operator of form 

L — -A + ^6/3 / +c (25) 

. actin S °n pen^^ 3/ = d/ds h A = b it and c 

smooth periodic functions ofs、c positive. Then for any smooth periodic function /, 
the equation 

L d : ' … . (26) 

ha ^ ^ unique smooth periodic solution u. We denote this solution as Kf = u. The 
operator K regarded as a mapping of the space C of continuous, periodic functions 
into C has the following properties: 

(i) K is a bounded map. 

(ii) Kis strictly positive. 

(iii) K is compact. 

Proof. We sketch those parts of the proof that do not require technicalities. Let 
*%ax and ^min denote the points where the function u takes on its maximum and min- 












imum values, respectively. At such points the first derivatives of u are zero, and I 
second derivatives are nonpositive, respectively nonnegative. Therefore we concli 
from equations (25) and (26) that 


^(^max)Wmax £ 
c ( s mm) u m\n i /(^min)* 




COnst.|/| m ax ， 〜 
in (i). 

It follows from (27) that if / is positive for all then so is w. We omit the addi¬ 
tional argument to prove what is claimed in part (ii): if / is nonnegative but ^ 0, u 
is positive. 

We forgo a proof of part (iii), except to remind the reader that an analogous but 
weaker result — the compactness of K in the L 2 -norm — was proved in theorem 5 of 
chapter 22. □ 
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FREDHOLM，S THEORY OF 
INTEGRAL.EQUATIONS—— 二 : 


The historically first general theory dealing with the solution of linear equations in 
infinite-dimensional spaces is due to Ivar Fredholm in the year 1900. Its importance 
was immediately recognized, and it spurred a great deal of further work by Hilbert, 
Schmidt, F. Riesz, Banach, and many others. These newer theories, set in what was 
formalized as Hilbert and Banach space, have completely replaced Fredholm’s the¬ 
ory. Unlike the new, abstract theories, Fredholm dealt with integral operators, and his 
central notion was the determinant associated with such operators. Since this deter¬ 
minant appears in some modem theories (inverse scattering, completely integrable 
systems), it is time to resurrect it. 


24.1 THE FREDHOLM DETERMINANT AND 
THE FREDHOLM RESOLVENT 


u(x) + f K{x.y)u{y)dy ^ f{x). (1) 

JO 

where / is a given continuous function on the interval [0,1] and u an unknown func¬ 
tion to be determined. The kernel K(x, y) is assumed here to be continuous; Fred¬ 
holm^ treatment allowed some singularities. 

The question of when equation (1) can be solved can be settled by theorem 5 of 
chapter 21. To put equation (1) in that context, we will regard both functions / and w 
as elements of the space of continuous functions C[0 ， 1]. As shown in chapter 22, the 

integral operator on the left of (1) is a compact operator. According to theorem 5 of 

chapter 21， the dimension of the nullspace of the operator acting on u on the left side 
of (1) equals the codimension of its range; by theorem 8, the range is characterized as 
those / that are orthogonal to the nullspace of the transpose of the operator, namely 
with kernel K\x,y) = K{y,x). This result was proved by Fredholm for operators 
of form (1) and is known to this day as the Fredholm alternative. 
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Fredholm’s approach was to replace the integral in (1) by a Riemann sum over 



determinants as n oo. 


The discretized form of (1) is 

ill +hY^Kijiij = fi, z_ = 1 ， … ， n ， (Y) 

where fi = f(ih), h = \/n and Kij = K{ih, jh). Denote by D(h) the determinant 
of the matrix acting on the vector u in (10: 

D(/i) = det(/ + / 2 ^ 7 ). (2) 

Clearly, D(h) is a polynomial in h: 

D(h) = Y i a m h m . (20 

0 

The coefficients a m can be determined conveniently as Taylor coefficients: 

a,n = h {ih) Dih)lh =°- (2，,) 

To differentiate a determinant, we use the rule 

去 det(Ci, ,., 1 C n ) = ^det (C" …， , C ;I ^ . ⑶ 

We use the same rule to find the mth derivative. The resulting formula is simple 
because each column Cj is a linear function of h. A further simplification is possible 
because at h = 0, Cy(0) = the yth unit vector. So using (2) in (2"), we get an 
expression involving determinants of principal minors of Kij: 

-D{h)-= 1 - 1-/1 ^ ^• / + 昏吞 det (g, f ") + .... (4) 

We now set h = l/n and let n tend to 00 . The ^th term in (4), a sum with respect to 
k parameters， tends to^MQi^ntegL^-lo mife 
introduced the following convenient abbreviation: 

二 ) =det/a w )， i.uu. ⑶ 

The formal limit as /i 00 of the finite sum (4 )，h = l/n, is the infinite series 


% 
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Defini 
in (1). 


D is called the Fredholm determinant of the operator acting on the left 


Lemma 1. The series (6) converges. 

Proof. To prove convergence, Fredholm relied on an inequality due to Hadamard 
for determinants to estimate the terms of the series (6): 

|det(Ci ， ...,Q.)l<niI C iH> ⑺ 

where \\C\\ denotes the Euclidean tength_of the vector C. The geometric interpreta¬ 
tion of (7) is that the volume of ttie parallelopiped whose vertices are sums of vectors 
Cj is < the volume of the rectangular parallelopiped whose sides Cj have the same 
lengths as the Cy ； 

lie} 11 = ||C 7 ||, 

where the C) are orthogonal to each other. 

Exercise 1. Furnish an analytic proof of inequality (7). 

The kernel K, being continuous, is bounded: 

\K(x,y)\<M for all X ，），， 

so the length of each column vector of the k x k matrix (5) is < M-Jk. Therefore, 
according to (7) 

^( v !' ^)1 ⑻ 

The kth term in series (6) is then less M k k k ^ 2 /k\, which, by Stirling’s formula, is 
< {Me) k k^ k ! 2 . This estimate shows that the series (6) converges. □ 

Fredholm showed howto get a better estimate than (8) in the case where the kernel 
K satisfies a Holder condition in the variable y; 

|^(^, } 0-^(^z)l<M|)^z| a , (9) 

In the matrix on the right in (5) we subtract the (f + l)st column from the fth, i = 
1 ， ...，fe — 1. This new matrix has the same determinant as the original one, and its 
ith column has length < - yi\ a Vk. It follows from (7) that 


< k k/1 M k -wl)' 
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By the arithmetic-geometric mean inequality 

so we deduce that 

( 内 (1/2) ， A (8,) 

The same inequality holds if AT(Ar,j) satisfies a Holder condition in the variable x. 

We return now to equations (1’). To solve such a system of equations, we multiply 
(l 7 ) by the inverse of the matrix acting on the unknowns. The elements of the inverse 
matrix can be represented as determinants of the minors of co-order 1 (i.e., one less 
than the order of the matrix to be inverted) divided by the determinant of the matrix. 
We apply this procedure to the system (1’）and obtain a formula analogous to (2), 
which can be brought to a form analogous to (4). Fredholm, by passing to the limit 
h — 0, has determined the continuous analogue/? of these determinants. Using the 
notation (5), we can write R in the form 
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Proof. Relations (12) and (12’）mean in operator language that 
R -f KR - DK = 0, 

R + RK - DK = 0. (15) 

Since D is assumed ^ 0, we can rewrite these as 

(I + K)(I-£> _1 R) = I, 

(I-ZT^RKI + Kbl. (150 

□ 

The converse is also true. 

Theorem 3. Let K be a continuous kernel such that D = 0; then the operator I + K 
has a nontrivial null-space and so is not invertible. 

Proof. In R(x t y) fix the value of y, and denote the resulting function of x by r: 

= /*(•)• 

Then equation (12) can be written as follows: 

r + Kr = 0; 

this shows that r belongs to the nullspace of I + K. The flaw in this argument is that 
R(x, y) could be zero for all x and y, so that r is the zero function. So we must argue 
differently. 

Let X denote a complex parameter. Replacing AT by 入尺 in formula (6) gives a 
power series in 入 ， which we denote as 

_ = //••_/. 七:::二) _ 

Similarly we define the - function ^(xT7TA ： )'by-repraiJing^ .——• ■ 

邮’⑼ = …心 1 .為 • （17) 

hemm^4^B^and=R (: x—y^yore^entire^analytic^^ 
for jail complex values ofL 

Exercise 2. Prove lemma 4. 

If we set -A：~ Hn-(-l 6) - and- (-l-T)rwe-get-back £)-and 普 cbfined-in-(6) and-(-10)rSor 
if D = 0 ， D(k) has a zero at X = 1. Since D(X) is analytic and not s 0, this zero is 
of finite order. 
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Theorem 6 shows that D(X) plays the role of the characteristic polynomial for 
the operator K. The algebraic multiplicity of an eigenvalue k of an operator K is 
the dimension of the space of its generalized eigenvectors, that is，the union of the 
nullspaces of (/d — K) f ， / any positive integer. According to theorem 6 of chapter 21 ， 
the geometric multiplicity of a nonzero eigenvalue oF a compact operator is finite. 

Theorem 7. Let k be a nonzero eigenvalue of the operator K defined in (13). Ac¬ 
cording to theorem 6, X - -\/k is a zero of the function D ( 入 ） defined in (16). 
Denote by m the multiplicity of this zero. Claim: m is the algebraic multiplicity of 
the eigenvalue k. 

Proof. The proof is the same as that in the finite-dimensional case, emplying mi¬ 
nors of cofinite order of the determinant D. We omit the details. 口 

The next result is a generalization to operators of two important matrix relations. 

Theorem 8. Denote by ic\ ， K 2 , … the eigenvalues of an integral operator K whose 
kernel K(x,y) is Holder continuous in x or y with Holder exponent > Then 

f K(x ， x)dx = Y^ K j 、 ( 20 ) 

D = (20 7 ) 

The series and the product converge absolutely. 


The quantity on the left in (20) is called the trace of the integral operator K; (20) 
is called the trace formula. In chapter 30 we will establish such a trace formula for a 
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Lemma 10. If the kernel K satisfies a Holder condition with exponent a, with re¬ 
spect to~eitherJhe.x or the y variable, then D(X) satisfies the growth condition 

- |D(A)| < const, (exp |A.| 2 / (1+2q:) ). 

Exercise 4. Prove lemma -10 ， using4he definition^ 16).of D(X) and inequality (8’). 

Clearly, for a > \ the function (A) is of order less than 1, and so can be factored 
as in (22). Using—the connection established in theorem 6 betweerTffie zeros of Z) and 
the eigenvalues Kj of K, we can write this factorization as 

二 : 二 -bJCjH ___( 22 . 0 - 

Here we have made use of the fact, evident from (16)，that Z)(0) = 1. We differen¬ 
tiate (22 ; ) and set 入 = 0. On the left side in (22 f ) we obtain dD (X)|x=o ? which 
according to the power series expansion (16) is equal to / K(x,x) dx. The infinite 
product on the right is for \X\ < 1 the uniform limit of finite products; therefore 
its derivative is the limit of the derivative of the finite products. This shows that the 
derivative of the right side of (22 , ) at X = 0 equals Kj. This proves (20). Setting 
A. = 1 in (22’）is identity (20 ; ). This completes the proof of theorem 8. □ 


24.2 THE MULTIPLICATIVE PROPERTY OF 
THE FREDHOLM DETERMINANT 

Next we present Fredholm’s extension of the multiplicative property of determinants 
to operators. We will denote the determinant of I + K by Dk, of I + H by D 尺 ， and 
so on. Similarly we will denote the inverse of I + K by I — D^R^：, and the kernel 
ofRjc as Rjc(x,y). 

Theorem 11. Let JEl and K be integral operators with continuous kernels, and set 
(I + H)(I + K) =I + L. Then 


d l = d h d k . 


(23) 


Proof. We present Fredholm’s beautiful argument based on computing the varia¬ 
tion of Df(. We define 

= 麥 Dk+€Sk\€=0 (24) 

d€ 

To compute SDk as function of SK, we first calculate the variation of the deter¬ 
minant (5). Using formula (3) for differentiating a determinant^ we get a sum of k 
determinants: 


SK ( Xl ’…’ Xk ) = YdttK e 


( 25 ) 
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Now deform K and H into zero so that D 尺 ^ 0, D// #0 during this deformation. 
This is easily done by setting K(t) = \{t)K y H[t) = \(t)H, where the complex- 
valued function X{t) avoids zeros of both D^{\) and D"( 入 ). Formula (31) shows 
that 


去 [log /)。 ⑴- = 0. 

Since L(0) = K(0) = H(0) = 0 and Dq =7, we deduce that log D^—Iog D^Dn = 
0; this proves the multiplicative property of the determinant when Dk 辞0 , D" ^ 0. 
When Dh = 0, 1 + H is not onto, and when Dk = 0, 1 + K is not one-to-one. In 
either case (I + H)(I + K) is not invertible, so Di = 0. Thus Di = DhD^ in ail 
cases. □ 

Exercise 5. Justify the calculation of 8Dk termwise in the infinite series defining 
Dk. 


24.3 THE GELFAND-LEVITAN-MARCHENKO EQUATION 
AND DYSON’S FORMULA 


So far we have taken the integration in the integral equation (1) over the unit interval. 
It could of course be any interval [a, b], even an infinite interval [a, oo], provided that 
the kernel 尺 tends to zero fast enough as its arguments tend to oo. 

Exercise 6. Show that if 


where M(x) is a decreasing function and integrable, 
roo 

/ M(x)dx < co, 

then the series (6) and (10) defining the Fredholm determinants converge. - 


Let K(x, v) be a continuous kernel defined for all real x and y; suppose that as 
.r, y tends to oo, the kernel tends to zero with sufficient rapidity. For any real a we 

-- define-the-FFedhQlm^pdr^QF-I-4 , -K Z 7- r -whQFe--K Tr -i-s-the--integral ope-mtog-e-veF-tfie - 

interval (a, oo) with kernel K: . 


(K fl w)Ct)= / K(\\y)u(y) dy y a < x. 


(32) 


Denote by D(a) the Fredholm determinant of I + K a . We will study how D(a) 

depends on a; we assume that D{d) ^ 0. - - - 

Let h be any real number; the simple transformation y y + h carries the 
operator K fl+ / Z into the integral operator with kernel K(x 4* h, y + h) over thejir^/ 
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Ky(z, y)R(y,z)dz- DK y (y,y) = 0. 


or the variation of log £> and set SK — K x + K y : 


<X ) + K v (x,x)]dx-D~ l // /?(}-,a:)[^(a-, j) 


⑷ =/ [A ： a -(a-,x) + K y {x,x)]dx+ / KAx,x)]dx 


[D- i R y {y,y)-K y (y,y)]dy 


[R x (x,x) + R v (x,x)]dx 




variation of log D(a) is its derivative with respect to a. 
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where D(a) is the Fredholm determinant of the C3-L-M operator acting on w on the 
left in (39). . .... . 
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INVARIANT SUBSPACES 


Let X denote a Banach space, M a bounded, linear map: X X.'Wt recall from 
linear algebra the notion of an invariant subspace: it is a subspace y of X that is 
mapped into itself under M. Y is called nontrivial if it is not the whole space and 
consists of more than the zero vector. In the context of normed linear spaces we are 
interested in closed invariant subspaces Y under M. 

Given a closed invariant subspace Y of M, there is the possibility of inverting M 
in two stages. The task is, given u in X, to find an xq in X such that JVLro = u. First 
we solve this equation mod Y, that is, find an x\ that satisfies Mx\ = u( mod Y)\ 
then z — ii - Mjq belongs to Y. Put xq =： x\ + y, and choose y in K so that it 
satisfies My = 2 . The first step amounts to inverting M on the quotient space X/Y; 
this task can be handled adequately by analytical tools only if X/Y has a normed 
structure, and that is possible only if 7 is a closed subspace of X, 

It follows from the algorithm presented above that if M is invertible on X/Y and 
on Y, then it is invertible on X. The converse is false, as shown in chapter 2. 

Exercise L Prove that if M : Z — X is invertible, and Y a finite-dimensional 
subspace of X invariant under M, then M is invertible on Y and X/Y. 

Exercise 2. Show that the subspaces of X invariant under an operator M form a 

lattice in the sense that the intersection of two invariant subspaces is an invariant 

supspd.cerand-the-elosttr^of-the-siim~ of two i n variant subspace-is an invariant sub¬ 
space. 


Of 1 TW A T>T A TVTnr OTTnCPA iQTTC f~\T? /^r^T\/fPA T V/f A Ttf 1 

u. i l jl ivrrt：jrcr 


Of course every subspace spanned by eigenvectors of M is invariant under M, but 
there are plenty of operators without any eigenvalues ： As we will show below, 
many of these possess nontrivial invariant subspaces. The following result is due to 
von Neumann for Hilbert space, and to Aronszajn and Smith for Banach spaces. The 
- pfoof-given-below-is-due-to-tomonosovr-with-a-simplification due to Hilden. 
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Theorem 1. A compact map C of a complex Banach space X of dimension greater 
than 1 into itself has a nontrivial closed invariant subspace. 

Proof. Suppose that C ^ 0; normalize C so that 

■*— —— -* -… • - • . |C| = 1 . ( 3 ) 

Since C ^ 0, we can choose jcq in X so that 


|Cx 0 | > 1 , \x 0 \ > 1. ( 2 ) 


B = {x\\x-X 0 \<l}. ⑶ 

It follows from (2) that the origin does not belong to B. Denote by K the closure of 
the image of B under C: 

K = CB. (4) 

Since C is a compact map, ^ is a compact set. It follows from (2) and (1) that K 
does not contain the origin. 

We will prove the existence of an invariant subspace by arguing indirectly: as¬ 
sume that C has no nontrivial closed invariant subspace. Now for any y ^ 0 the set 
{/?(C)v} s where p is any polynomial, is a linear subspace of X invariant under C. 
The closure of this subspace is a closed invariant subspace of C, and since C is not 
supposed to have any nontrivial ones, this one must be trivial, meaning all of X. So it 
follows from our supposition that for any y ^ 0, the set {p(C)j}，any polynomial, 
is dense in X. 

Since we have seen before that K does not contain 0, we can apply the above to 
any y of K. In particular, for any y in K there is a polynomial p such that 

|/7(C)y~A* 0 | < 1 . (5) 

Since (5) is a strict inequality, the set of y that satisfies (5) with given p is an open set 
O p . Such open sets cover K\ since K is compact, a finite collection suffices. Thus 
there is a finite collection of polynomials …， pn such that for every 3 ； in A ： 
inequality (5) holds with some p = pi ，We introduce the abbreviation pi(C) = C/, 
!• = 1 ， ♦••，//; the state of affairs can be expressed as follows: for every y in K, 

!C/J~^ol < 1 ( 6 ) 

for at least one i. 

The point xq belongs to B\ according to definition (4) of K, Cxq € K. Therefore 
we may in (6) set y = Ca*o and conclude that for some i = i\, |C/, Cxo 一 ； col < 1. 
By definition (3) of B, this means that the point C/, Cxq belongs to B, and so, by 
definition ⑷ of 尤， CC/,Cxo e K. Therefore we may in ⑹ set )，= CC/, Cxq and 
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conclude that for some i = z*2» 

|C/ 2 CC/,Cro — jcoI < 

Repeating this argument, we conclude recursively that 

J^CC^C^o-.voj < 1. 

We obtain from this by the triangle inequality that 

f[(C/ ; .C)^o| > l-t 0 | - 1 ； (7) 

note that according to (2), the right side is positive. 

Since the C/ are polynomials in C, they commute with each other and C, so (7) 
can be rewritten as 

^flC ( ^C%|>|,r 0 |-l. (70 

Denote the largest of the norms of Q by c: |C/| < c,i — I ， … ， N. Then it follows 
from (7’) that 

Taking the nth root and letting n oo, we conclude that 

lim |C n | 1/n > 

/I—OO c 

According to spectral theory (see theorem 4, chapter 17) the quantity on the left 
is the spectral radius of C. Since |cr(C)| > 0, the spectrum of C contains points 
other than 0. But according to the spectral theory of compact operators (theorem 6 of 
chapter 21) such points.are eigenvalues of C of finite multiplicityr-The-eorresponding - 
eigenspace is invariant under C, in contradiction to our supposition. This proves 
theorem 1. □ 


25.2 NESTED INVARIANT- SUBSPACES= 


Once we have proved the existence of a single invariant subspace Y, we can, by 

invoking that theorem repeatedly, prove the existence of infinitely many invariant 

subspaces. Thus Y itself has an invariant subspace，and so does X/Y t which gives 

rise to an invariant subspace Z,YcZcX. -- 

In an interesting paper, Ringrose has explored such families of invariant sub¬ 
spaces. 












278 


INVARIANT SUBSPACES 


Definition. Let X be a complex Banach space. A collection of closed subspaces 
{M} of Z is called a nest if the subspaces M are nested, meaning totally ordered by 
inclusion; a nest will be denoted by the symbol AT. 

Definition. Let X be a complex Banach space, J\T a nest of closed subspaces of X, 
—C a compact map~of X intoitself. If each subspace in the nest J\f is invariant under 
^Cv-then^lis called an invariant nest for C. 


Theorem 2. Let C be a compact map of a complex Banach space X into itself. 

(i) There exists a maximal invariant nest N for C. 

—~Wn^ contains the trivial subspaceT{^Tan3rX ： •— ― - 

(iii) LetM^bea subset of N. The subspace N = fl{L: L in A/*o) belongs to M. 

Proof. The existence of a maximal invariant nest follows from Zorn’s lemma. The 
space N described in (iii) is obviously closed and invariant under C. Take any K in 
A/*; if some L in A/o is contained in K, so is N. Otherwise, every L in Nq contains 
K, but then so does N. Therefore, since A/" is maximal, N must belong to J\f. □ 

Theorem 2’ (Ringrose). Let Cbea compact map of a complex Banach space X into 
itself, M a maximal invariant nest ofC. Denote by M any of the closed subspaces 
contained in the nest Af, and denote by M- the subspace 

M 一 = U{L inJ\f: L properly contained in M}. (8) 

By maximality, M_ belongs to the nest J\f • 

(i) The quotient space M/ M— has dimension 0 or ]• 

(ii) Suppose that dim M/M- = 1. Then C maps MjM 一 into itself as multiplica¬ 
tion by a scalar; denote this scalar by fi. If fx _ Q, fx is an eigenvalue ofC. 

(iii) Conversely, every nonzero eigenvalue y of C occurs as ijl for some MH 
The number of times y occurs as fi in M equals the algebraic multiplicity of 
y as eigenvalue ofC, that is t 

max dim nullspace of (yl — C) f . 
i 

Proof, (i) Since both M and M- are invariant under C, C maps M/M- into itself 
and is a compact map. If dim M/M- were > 1， by the Aronszajn-Smith theorem C 
would have a nontrivial invariant subspace in M/M-. This would correspond to an 
invariant subspace L of C in X， where L properly contains M 一 and is properly con¬ 
tained in M. Since AT is maximal, L belongs to A"; but this contradicts the definition 
(8) of M 一. 

(ii) /xl — C is the zero mapping of M/M- into itself. This means that /xl — C 
maps M into A/—. Since C restricted to M is compact, and /i ^ 0, it follows from 
theorem 5 of chapter 21, that the nullspace of (/ii — C) in M has the same dimension 
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We saw in chapter 22, equation (15)，that this map is compact but has no eigenvec¬ 
tors; therefore its spectrum consists of the point 0. On the other-handj-the-dosed - 

subspaces C a of C consisting of those / that vanish on [0, a] are obviously invariant _ 

forV. 

Exercise 4. Prove that the subspaces C fl , 0 < a < 1 forma maximal nest. 

.Brodsky and Donoghue-have,-independently,-proved-in the. iLr. topology-the_con- - 

verse of the propostion above: 

Theorem 3. Define H to be the Hilbert space L 2 [0,1], and define H a to consist 
of those functions in H that are zero on the-isubmteji}ar[0^a ], 0- - 

integration operator V defined by (9) maps H into itself，and every H a into itself. 
Conversely, V has no other invariant subspaces. 

Proof. (Donoghue) We start with a brief discussion of the convolution of L 1 func¬ 
tions on the real line. The convolution / * g of two L 1 functions on the real line is 
defined by 

(/ *^)(0= f f(s)g(t-s)ds. (10) 

.. J □ 

We have shown in section 6 of chapter 19 that the convolution of two L 1 functions 
is L\ and that convolution is associative and commutative. 

Suppose that both f(s) and g(s) are zero for ^ large enough negative, and denote 
by if, respectively l g the lower end of the support of / and g: 

If = sup{£: f(s) = 0foTS<e}. (11) 

It follows from this definition that the integrand on the right in the definition (10) 
of convolution is zero when t < 名 y + for either one or the other factor is zero. 

This shows that (/ * g)(t) = 0 when7 < £/ + 心， from which it follows that 
> if + lg. According to the Titchmarsh convolution theorem the sign of 
equality holds: 

+ 之 g. (12) 

A proof of this remarkable theorem will be given in chapter 38. 

"The mapping V defined by (9) it can be expressed as a convolution: 



where / is set to zero outside the interval [0,1], and the function h is the Heaviside 
function 


h(s )= 


for ^ < 0 
for 0 < 


(14) 


Note that the right side of (13) is defined on all of E; its restriction to [0,1] equals 
V/. It follows from the associative property for convolution that for any natural 
number n, 


Y'f = h (n) * f, (13，) 

where is the ;?-fold convolution of h with itself. An easy calculation, already 
performed in chapter 20, shows that 


h (n Hs)= 


>/n! 


for ^ < 0 
forO < s. 


(140 


The nontrivial part of theorem 3 is that the only invariant subspaces of V consist 
of all L 2 functions that vanish on an interval [0, a]. This follows from 


Lemma 4. Denote by f any function in L 2 [0, 1]. The set of functions f, Yf, V 2 /, 
… span L 2 (t, 1), where € = 

Proof, Suppose that g in L 2 [0, 1] is orthogonal to V 71 /, n = 0, 1,.... We can, 
using (13’)，write this condition as 

(h {n) *f,g) = 0, n = 0, 1,... (15) 

where (,) denotes the scalar product in L 2 [0,1], Now define the function g- in 
L 2 (-l,0)by 


g-(s) = g{-s). (16) 

For anxJfunction k in L 2 [0,1], we can write the Lr scalar product as 

(k, g) = (k * g_)(0). (17) 

Using associativity of convolution and (17), we can write condition (15) as 

(/i w */*g_)(0) = 0; 

using (17) once more，we find that this is equivalent to 

(化)，/*尽一 ) = 0. (150 

Note that /*g_ is supported in [—1，1]，and h^\s) = s n /n\ on the interval [-1,0] 
—j - n c: 一一 Weierstrass approximation theorem polynomials are 
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dense in L 2 [—1,0], it follows from (15 7 ) that /* 容- = Oon [—1, 0]. We can express 
this by the inequality l > 0. According to the Titchmarsh convolution theorem, 
it follows that 

+ > 0 , 

which implies that 茗二⑺ = 0 for f f. By definition (16) ofg- this is the same 
as 


g(s) = 0 for s > if. 

Since g is any function orthogonal to all V /! /, this shows that_the orthogonal cor^)le- 
ment of the span of {V 71 /} is contained in L 2 [0, £]. It follows that the span of{V n /} 
contains L 2 [£, 1]. On the other hand, since*each function V n / is zero on [0, £], it 
follows that the set {V n /} spans L 2 [i, 1]. □ 

Theorem 3 follows easily from lemma 4. To see this, let F be a closed invariant 
subspace of V in L 2 [0,1], Given any / in Y, V/, V 2 /, and so on, also belongs to Y, 
and so by lemma 4, Y contains Lr[if, 1]. It follows that Y contains L 2 [a, 1], where 

a = inf If. 
fmY J 

On the other hand, by definition of a, f{t) =0forr < a for any / in Y» This shows 
that7= 乙 2 0，1]. □ 

The Aronszajn-Smith theorem has been extended in several directions. Robinson 
and Bernstein have shown, using Robinson’s nonstandard analysis, that an operator 
T has an invariant subspace, provided that there is a polynomial p such that p(T) 
is compact. Lomonosov has shown that every operator that commutes with a com¬ 
pact operator has an invariant subspace. Whether every noncompact operator has an 
invariant subspace has been an open problem, until Enflo exhibited a Banach space 
X ， especially constructed for this purpose, and a linear mapping of X into X that 
is irreducible, meaning that it has no nontrivial invariant subspace. Subsequently ir¬ 
reducible linear maps were exhibited for more familiar spaces but none in reflexive 
spaces. It is in particular an open question if there are irreducible operators in Hilbert 
space，and it is an open question whether this question is interesting. 

In chapter 38 we will present Beurling，s description of all invariant subspace of 
the right shift operator R acting on £ 2 . 
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HARMONIC-ANALYSIS 

^SF^SSfcFfcl-NE—-_ - 


In this chapter we refine the technique described in chapter 21 for the study of com¬ 
pact operators, and use it to deduce the exponential decay of a class of functions. 


26.1 THE PHRAGMEN-LINDELOF PRINCIPLE 
FOR HARMONIC FUNCTIONS 

The maximum principle of the theory of analytic functions has a classical general¬ 
ization due to Phragmen and Lindelof for analytic functions defined on domains that 
stretch to infinity. The assumptions are that the function is bounded on the boundary 
of the domain, and that it is of limited growth as z approaches infinity. The con¬ 
clusion is that the function is bounded in the whole domain. This principle has an 
analogue for harmonic functions which we now state and prove: 

Let h(x ， y) be a harmonic function defined in the half-strip —1 < jc < 1, 0 < 
and continuous up to the boundary. We assume that 

(i) h(±hy) = 0fory>0. 

(ii) \h{x, )01 < const. e ly in the half-strip，where l is some positive number less 
than 丌 /2. 

We claim that then \h(x, y)\ < const, e 一 my k in the half-strip, where m is any number 
less than n/2. 

Proof. We want to compare h to the auxiliary harmonic function 

k 八 x' y) = Acos mxe^ ny + € cosmxe my , (1) 

where m is some positive number, 
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Because of the upper bound placed on m, the function cos mx is positive on —1 < 
jc < 1; choose A in (1) so large that for all x on [—1,1], 

1^(^, 0)| < A cos mx. (2) 

We claim that \h{x,y)\ < k € (x t y) for all x, y in the half-strip, for any positive 
choice for To see this, we note that 

-k € (x,y) < h(x t y) < k € (x,y) (3) 

for y = 0,x in [—1,1], since A has been so chosen. Furthermore (3) holds for 
'y > 0 t x = ±1, since by hypothesis (i) h is zero there, while k € is positive. Finally 
we note that (3) holds for y = Y large enough, |^:| < 1, since by hypothesis (ii) 
h grown at most as e ly , while k € grows as e my , and m > i. Applying the classical 
maximum principle to the rectangle 0 < y < y, —^<x< 1, we conclude that 
(3) holds inside the rectangle as well. Since Y is arbitrary, (3) holds in the entire 
half-strip for any positive value of e. Letting € tend to zero, we obtain that 

\Mx iy )\<Ae^ n y 

in the entire half-strip. This proves that h decays exponentially as y tends to oo. □ 


26.2 AN ABSTRACT PRAGIVIEN-LINDELOF PRINCIPLE 


This section is devoted to the statement and proof of a sweeping generalization due to 
the author of the Phragmen-Lindelof theorem stated above. It is based on a functional 
analytical abstraction of the space of harmonic functions. 

Definition. Let X be a Banach space over C, 5 a linear space of locally integrable 
functions «(}；) on the positive reals y > 0 whose values belong to the Banach space 
X. We impose the following two conditions on the space S: 


(i) 5 is translation invariant, that is, if u(y) belongs to 5, so does u(y + t),t any 
positive number. 

rb 

\u\ b a = Hy)\dy, . (4) 

where 0 < a < Z?. We require that the unit ball in 5 in the \u\ b a norm be 
precompact in the |w| 忠 -norm whenever [«o» ho) is~a~coi5pacl~s"ubinterval of 
[a, b], that is, a < ao < bo < b. 
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A similar definition can be given in terms of the L^-norm 

M 卜 ( 乂 6 \u(y)\ p dy^j (40 

in place of the L ] -norm. 

- Ppoperty callQd interior compactness. . . . . - - 

If we adjoin to S all limits of sequences of functions in S that converge in the L 1 
sense on compact subsets of R+，the extended space retains both properties (i) and 
(ii). Therefore we may assume without loss of generality that (iii) S is closed in the 
sequential topology described above. 

Example L Take X to be the space of continuous functions on [—1 ， 1] that vanish 
at the endpoints. Take S to consist of all functions u(y) = h(x 、)，)，where h is any 
harmonic function in the half-strip x e [—1,1], J > 0, continuous up to the bound¬ 
ary and equal zero for x = 士 1. It is not hard to show that the space S defined in 
example 1 has properties (i) and (ii). 

Example 2. Let L be a linear elliptic operator of order 2/2, whose coefficients are 
independent of one of the variables, call it y. Let G be a domain in the space of the 
remaining variables, call them x, whose closure is compact and whose boundary is 
smooth. Take the Banach space X to be the space Hq of functions in G with square 
integrable derivatives of order n, whose derivatives of order / = 0,1，…， n — 1 are 
zero on the boundary of G. Take S to consists of all functions of the form u(y)= 
h(x, y), where h(x, y) is any solution of Lh = 0 in the half-cylinder G x whose 
values for fixed y belong to X. 

Using the theory of elliptic equations we can show that a space S defined above is 
interior compact. 

Example 3. X = C,S the space spanned on R+ by a set of exponential functions 
where the /x" are real, positive, and < oo. According to Mtintz’s 

theorem (see chapter 9) S is a proper subspace of the space of continuous functions 
on E+ that vanish at oo. These spaces were studied by Laurent Schwartz; the interior 
compactness of such a space follows from properties established by Schwartz. 

We state now our abstract Phragmen-Lindelof principle. 


Theorem 1. Let S be a translation invariant, interior compact space of vector¬ 
valued functions on M+. Then there exh * 一 ””，•” -- ’ 

the space S, such that every function u i 
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also satisfies 


\u{y)\e cy dy < oo. 


Proof. Denote by K the subspace of S consisting of integrable function u(y); K 
is a Banach space under the L^norm 


M = j 0 隐 


Denote by T(r) the translation operator acting on K : 

(T(t)u)(y) = u(y + 1), 


(5) 


⑹ 


Clearly, the operators T(0 are contractions, that is of norm £ 1. Therefore the spec¬ 
trum of each T(/) lies in the unit disk. The key to theorem 1 is to show that the 
spectrum of T(f), r > 0, lies inside the unit disk. 

The proof is somewhat elaborate and is based on 12 lemmas and 2 propositions. 
In what follows T denotes any of the operators T(r), r > 0. 


Proposition A. Every nonzero boundary point X of the spectrum of T that lies inside 
the unit disk: |X| < 1 and k _ 0, is an eigenvalue of finite multiplicity. That Is ， 

(7) (T — 入 ) has a nontrivial nullspace of finite multiplicity. 

(ii) The space of all generalized eigenfunctions is finite dimensional 

(iii) Denote by N the space of all generalized eigenfunctions. The mapping T 一入 
of K/N into K/N is invertible. 

(iv) k is an isolated point of the spectrum of T over K. 

Proof. We need a series of lemmas: 

Lemma 1. Let X be a complex number inside the unit disc, |A.| < 1, and 人户 0. Let 
{w n } be a bounded sequence of functions in K such that for some positive integer k, 

入一 lim (T — \) k u n = v. (7) 


r ..Jhm.a,siibsequence of u n converges strongly in K, and the limit u satisfies 
(T 一 k) k u = v. 

Proof. It suffices to prove the result fork = 1. We have assumed that the sequence 
bounded: |«„| < M. Therefore 

— \ u n\ S 
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Since the functions u n lie in an interior compact space, there is a subsequence, again 
denoted as {“,,}，that converges in the |“| / 2/ -norm: 

..._ Wn — u m \f l 0. (8) 

We claim that the sequence [u n ] converges in the |«|g-nonn for any a. For according 
to (7) with/: = 1; 'converges in-llie-L 1 -norm; Since (Tw„)()，）= u n (y-bt) 

converges in the | |Q-norm; 

- i t 』 她 ex fo U 晒 扣观 vergeU^ Repeating this argument-TV：- 

times we conclude that {u n } converges in the | ’-norm，as asserted. 

To complete the proof of lemma 1, we show that the functions u n have uniformly 
- ― small L 1 norms-near oo. We take the algebraic identity 

T m = ， + 1 (T - 入） 

7=1 

and apply it to w , 卜 Denoting (T - 入 ) = u", we get 

m 

T m u„ = X m u n + J2 入 V ⑼ 

1 

By hypothesis (7), v n = u + , where \e n \ 0. We rewrite the identity above as 

一 mm 

r n U n =J! n u„ +J2 X，n ~ j ^ J ~ 1 ^ + (90 

1 1 

We claim that as m and n tend to oo, the right side of (9’）tends to zero in the norm 
of K. The last group of terms can be estimated as 

I )T 卜 、 I £ f] \X\ m -j\e n \ < 

1 1 — I 入 I 

To estimate the next group of terms, we note that = dj tends to zero as j 

tends to oo; therefore 

m 

一 1 

tends to zero as m tends to oo. Finally \k m u n \ < \X\ m M tends to zero as m tends 
to oo. So we conclude from (90 that given any positive €, 

\T m u n \ < € 


for all n and m large enough. Since 
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this proves that the functions u n are uniformly small at oo. Hence the subsequence 
{u n } converges in the L^norm not only over finite subintervals but on the whole 
positive axis. □ 

Lemma 2. Let T be any bounded, map of a Banach space K over C into K, and let 
\ be a boundary point of the spectrum of T. 

(i) There exists a sequence of unit vector u n such that 

|(T 一入 ) (10) 

(ii) The image of K underT — kis a proper subspace of K. 

Lemma 2 is a restatement of theorem 2 of the chapter 20. □ 

We turn now to proving part (i) of proposition A. Let 入 be a boundary point of 
the spectrum of T ， | 入 | < 1 and 入 ^ 0. According to part (i) of lemma 2, there is a 
sequence of functions {«„} in AT, |w ;I | = 1, that satisfies (10). According to lemma 1, 
a subsequence converges to a limit u that satisfies (T 一入 ) w = 0, Since a limit of 
unit vector is a unit vector, u is an eigenvector; this shows that 入 is an eigenvalue. 
Denote by N\ the space of eigenvectors with eigenvalue X; we claim that it is finite 
dimensional. To see this, we note that (T — X)u = 0 means that 


u(y^t) = ku(y). 

It follows that for an eigenfunction u. 



\u(y)\dy = (IXf- 1 


+ \M) \u(y)\dy, 


= (|xr r +l + |A|)|«|^. 

This shows that for eigenvectors the | norm, and the | \j l norms are equivalent. 
But the unit sphere in the former-nGFm-is-by~interior-GompaGtness*preGompact-in- the- 
iatter norm; that makes the unit ball of N\ in the | |^-norm precompact. But then 
according to theorem 6 of chapter 5, N\ is finite dimensional. This completes the 
proof of part (i) of proposition A. 


-the- 


iitg-genefafeedr-eigenfenetiensT-defified- 
,^nullspaee-/¥jt-of-the=operator-4T^X-)- r fe^ already:-noted-in- chapter 2^. 

theorem 2, if N\ is finite dimensional, then all the nullspaces Nfc are finite dimen¬ 
sional, and furthermore their dimensions satisfy the inequality 


dim Nfc — dim > dim — dim 


( 11 ) 


This inequality derives from the fact that (T — 人 ） maps one-to-one into 

N k /h 















-dmrNk+\/N k =mm 7T0N k - 


k > i 


an 


Proof. A nonincreasing sequence of nonnegative integers such as dim A^+i/A^t 
eventually consists of the same numbers. _ _ — □ 


Iem 慰 Htfollo 爾她 tnT 

*onto ~tz . . ... . . .. 




- a ； id a oounaea map oj yv/yv/^j onto /V/7V/. 
Part (i) is purely linear algebra: 
art (i) of lemma 4. 










































a boundary point of the spectrum. Therefore 
any nullsequence {^ n } of positive numbers, 
of functions in K for which the annHent 






|(T 一 A.Ju/jIq < ^/i|u/i|q. 


We choose now a ~nt, and denote 


We claim that for all integers k < 


l^ilo^ — ^ ~ (180 

For = 1 this is the normalization (18); for fc > 1 it follows inductively form (17"). 
So we have for k = n that 

- < w + (/2 — l)€ n A n . 

Since we have chosen ^ = 1/n 2 , it follows that < In. Setting this into (17 ")， 
we get 


l(T 一入 二 . 


(18") 


% 





+ 0 = -v t 


As t tends to zero, these functions v t oscillate more and more rapidly, and clearly 
violate interior compactness. Hence X = 一 1 does not belong to the spectrum of 
T(r) for? small enough. □ 

Lemma 9. The spectrum of T(0 in 0 < |X| < l is a discrete set of points accumu¬ 
lating at most at the origin or on the unit circle. 

Proof. We have shown that for t small enough 入 =—1 belongs to the resolvent 
set of T(0. Therefore some open set around 入 = 一 1 belongs to the resolvent set. 

According to proposition A, the boundary points of the spectrum of T(r) inside 
the unit circle form a discrete set of points that acumulate at most at the origin or 
on the unit circle. We claim that all points of the spectrum inside the unit circle 
are boundary points. Suppose on the contrary that there were a point that is not a 
boundary point Such a point could be connected to 入 = 一 ] by a polygonal path that 
avoids the discrete set of boundary points, but that is a contradiction, for such a path 
must contain a boundary point. This proves lemma 9 for t small enough; for any 广 it 
follows from T(/) = T m (t/m) and the spectral mapping theorem. 

Let p be any positive number; denote by the set of functions of the form 

u^(y) = e-^uiy), winS. (19) 

Clearly, is translation invariant, and it is interior compact, since for a and b 
finite, 

e- pb \u\ b a <\u^\ h a <e-P a \u\ b a . (190 

We define the space as the subspace of consisting of integrable functions 
is a Banach space under the L^norm. 

From here on T denotes the unit translation T(l). 
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Proof. Recall that every A. in the spectrum of T over K is an eigenvalue in 
inse that there is an eigenfunction u satisfying 

u(y+ 1) = My )， 

or I 人 I < 1 the eigenfunction u belongs to K, and for \k\ = 1,« belongs to 
I earl v. u(P) = e^Pyu belongs to and satisfies 



- , j different from 

these discrete set of values, and arrange the eigenvalues Xj of T acting on in 
decreasing order in absolute value: 

1 > I 入 ll 2 I 入 2I 2 … — 0_ 

With each eigenvalue Xj we can define a projection Py, 

= f (C-Tr 1 ^. _ 

JQj 

where Cj is a circle around Xj that contains no other point of the spectrum of T. 


(i) The operators V j are pairwise disjoint projections: 

P ； = P 7- P 7' P * = ° f or j _ k . 










K ⑼; using the decomposition (22), we can 1 
u=^ePyfj 


Now choose the integer m so large that \X m 
follows from ineaualitv Y21Y that for it large 


i ms snows mai me last ierm on me ngnt 
rest of the terms e py fj decay exponentially, 
ized eigenfunctions of T with eigenvalues \ 
less than 1, for otherwise they and their sui 
contradict the fact that u belongs to the snac« 
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are fulfilled for all kj that satisfy \Xj\e p < 1. It follows then from (220 and (23) that 
e cy u(y) is integrable. This completes the proof of theorem 1. □ 

26.3 ASYMPTOTIC EXPANSION 

We show now how to use the results derived for the proof of theorem 1 to give an 
asymptotic description of functions contained in the space K. 

Lemma 13. 


(i) There is a basis for the eigenfunctions of T(l) over K consisting of exponen¬ 
tial functions, that is, functions of the form 



Denote by {jLty} the set of all fM that appear in an eigenfunction (24) of T(l)；-it 
follows from proposition B that the real parts of \Xj tend to 一 oo. 


Theorem 2. Let S be as in theorem 1, and K the space of those functions in S that 
are integrable. Every function u(y) m K has an asymptotic expansion of the form 

U(y )« 勺， 


(27) 













. ej = Y^y kw J ， k- (270 

Proof. Decompose u as in equation (22)，with p = 0. Use of the formula for the 
容 proves the asyiliipfotic character of the expansion (27). □ 

_We return to example 2 .of.e11ip.tic..Qpfirators of.order.2/tof.form - - - - -. • 

In 

L = Yl A J d y n ~ j < (28) 


where Ay = Aj(x, dx) are linear partial differential operators of order j in the 
variables jq ， • •. ， x nu with coefficients that may depend on x but not on y, Ellipticity 
means that 

Li^ri) = Y / A j (x, > cQ^+r, 211 ) 

for all 工 ， c some positive constant. A typical example is 
L 0 = a 3 2n + A；. 

Let G be a smoothly bounded domain in : c-space，whose closure is compact. We 
take S to be the space of all solutions u = u{x, y) of Lm = 0 in the half-cylinder 
GxK + , which satisfy so-called coercive boundary conditions on 3 G. We take these 
boundary conditions the same for all y\ the simplest example is Dirichlet boundary 
conditions. We regard these solutions as functions u(y) whose values belong to the 
Banach space of functions of x in G, satisfying the boundary conditions, normed by 
the Sobolev norm 

J c \ d x u \ 2dx - 

It follows from the theory of elliptic equations that the space S of these functions 
u(y) form an interior compact space. 

Consider all solutions of Lu = 0 in the half cylinder G x M + whose dependence 
of y is exponential: 

uix.y^^e^wix). (29) 

Setting this in (28)，we conclude that w satisfies the equation 

(2> 2, 卜〜 ) 边 = 0 ， (30) 

and w satisfies the prescribed boundary conditions. Applying theorem 2 to this con¬ 
crete situation yields the following result. 
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Theorem 3. 

(i) The set of values 糾 for which (30) has a nontrivial solution w satisfying the 
boundary conditions is a discrete set in the sense that each strip 

a < Re fXk < b 

contains only a finite number offi^ 

(ii) Each solution of Lu = 0 that is integrable in the half cylinder has an asymp¬ 
totic expansion of the form 

m % [ e w - v 叫 (y ) ， RefMk < 0, 

where w^{y) is a polynomial in y. 

Exercise 3. Take 

l = + a?) 2 , 

G the interval [0, 丌 ]， and w subject to Dirichlet boundary conditions 
u = u x = 0 when .r = 0 or it. 

Show that ii in (30) satisfies the equation tan 2 \itz = (/xjt) 2 /[1 4 - (/xjt)] 2 . 
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INDEX THEORY 


We start with stating the main results of index theory in chapter 2. Let U, V be linear 
spaces, in general, infinite dimensional. A linear map T : U V is said to have 
finite index if it has these properties: 

(i) The nullspace Nj of T, is a finite-dimensional subspace of U. 

(ii) The quotient space V/Rj：, Rt the range of T, is finite dimensional. 

For such an operator we define the index as 

indT — dim Nj ： - codim Rj：. (1) 

A map G from one linear space into another is called degenerate if its range is 
finite dimensional. We recall from chapter 2 the following results: 

Theorem A. A linear map T : U V has finite index iff T has a pseudoinverse, 
that is, a linear map S : V U such that 

ST = I + G, TS = I + H, (2) 

where I denotes the identity in U and V respectively, and G, H are degenerate maps. 

Theorem B. Let T •• U — V and R : V W be linear maps with finite index. 
Then their product RT has finite index, and 

indRT = indR-f indT. (3) 

Theorem C. Let T : U V be a linear map with finite index, and G : U V a 
degenerate linear map. Then T + G has finite index, and 


ind (T + G) = indT. 


⑷ 
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27.1 THE NOETHER INDEX 

In this chapter we present a corresponding analytic theory dealing with bounded 
linear maps of one Banach space U into another V. As before, T has finite index 
if it has properties (i) and (ii). Condition (ii) implies, according to the closed graph 
theorem, theorem 14 of chapter 15, that the range of T is closed. 

The natural concept of pseudoinverse in this context is the following: 

Definition. Two bounded linear maps T : U V and S : V U are called 
pseudoinverses of each other if 

ST = I + K, TS = I + H ， (5) 

where K and H are compact maps of U, respectively V, into themselves. 

The analogues of theorems A, B, and C hold in the Banach space context, with very 
much the same proofs. There are three additional results with no counterparts in the 
linear algebra context. 

We will use repeatedly the basic result theorem 5 of chapter 21: 

Theorem Q. Let K : U U be a compact map. Then I + K has finite index，and 

ind (I + K) = 0 (6) 

The Banach space analogues of theorems A ， B，and C are theorems 1 ， 2, and 3: 

Theorem 1. A bounded map T \ U — V has finite index iff T has a pseudo inverse 
in the sense of definition (5). 

_ It is easy to verify that the pseudoinverse S constructed in chapter 2 for 
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From theorem 2 we can deduce 
Theorem 2 / . If T and S are pseudoinverses to each other, 
indT = —ind S. 

Proof. Applying the multiplicative law (3) to (5) we deduce that 

. . — maT + ihdS S irid Or+ K) . • ― 

According to theorem 0, ind (I + K) = 0, so (7) follows. 

The next result is due to Yood: h 

Theorem 3. Suppose that T : C/ V has finite index, and L is a compi 
map U V. Then T + L has finite index, and 

ind(T + L)= indT. • 

Proof. Since T has finite index, it has pseudoinverse S. Obviously S is i 
doinverse to T + L，for 

S(T + L) = ST + SL = I + K + SL ， 



dimN^ = codim 
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It follows from (8) that the nullspace of T’ is the annihilator of the range of T; 
this proves the first part of (10). Similarly (8) shows that the nullspace of T is the 
annihilator in U of the range of T\ This would prove the second part of (10) if U 
were reflexive; otherwise, it only shows that 

codim Rx^ > dimNx- 

Subtracting the first part of (10) from (10’）gives 
-indT 7 > indT. 

By (11), applied to S, 

一 indS’ > indS. 

Add this relation to (11): 

—indT’ 一 indS’ > indT + indS. 

According to (7), both sides of the above inequality are zero. However, this can only 
be if equality holds in both (11) and (1 1% and therefore also in (100. This completes 
the proof of (10); relation (9) follows. □ 

The next result, due to Dieudonne, is called the stability of the index: 

Theorem 5. If T : U V has finite Index, there is e > 0 such that for all bounded 

linear maps M : C/ V satisfying |M| < 


(100 

(ID 


S(T + M) = i 

Choose e = |Sr l ; then |SM| < 
invertible. Multiply (13) by (I + SIV 


(I + SM) -1 S(T 



jy uicurcm ^ oi cnapier u ， i -f oivi is- 
i the left: 

= I + (I + SMr l K. 

ejjMpT - 土上 

id(I-+SM)= l S.. (130 


srtible, multiplication by it doesn’t change the index: 


gives 


=indS. 


□ 






，?: 
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Then 

iim indT" =indT. (17) 

n-^oo 

For a proof, see Hormander, theorem 19.1.10. 

The following is an immediate consequence of theorem 6: 

Theorem 6 / . Let T(t) : U V and S(0 : V — U be bounded linear maps 
depending on a parameter ^ 0 < ^ < 1, continuously in the strong topology. Assume 
that S(t) and T(f) are pseudoinverses of each other: 

S(OT(r) = I + K(0, T(f)S(0 = I + H(0, 

where the maps K(t) and H(f) are unifonnly compact in the sense of (15), (J5’). 
Then indT(r) is independent oft; in particular, 

indT(0) = indT(l). 

NOTE. Operators with finite index are usually called Fredholm operators. There is 
no historical justification for this. It would be more appropriate to call operators with 
index ^ 0 Noether operators, and their index the Noether index, for Fritz Noether had 
given the first example of such an operator, defined the notion of index, and proved its 
stability, in the context of singular integral operators; see Hormander and Dieudonne. 
The important multiplicative property (3) is due to Atkinson and Gohberg. 

HISTORICAL NOTE. Fritz Noether was the brother of Emmy Noether; he fled to the 
Soviet Union to escape Nazi terror, and there fell victim to Stalin’s terror. 

We turn now to some examples of operators whose index can be calculated.ex.- 
piicitly. In chapter 30 we will show that in some cases the index can be expressed as 
the differences of the trace of two operators. 


27.2 TOEPLITZ OPERATORS 

We denote by L 2 (5 l ) the space of square integrable complex-valued functions u on 
'tfie unircircle.' If forras a"uricfer the L 2 -nonfi 

ll«ll 2 = J \u(ey\ 2 de .-：- . —… -—- (ts) . 

The functions e lkG form an orthonormal basis; every u in L? can be expanded as 

um = Y J U k e ike , 

—oo 


(19) 
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where the Fourier coefficients are given by ... 

- - - ~ ut= ju(0) e^ ike d9. . (20) 

The Parseval relation holds: ... _ . ____ 

■ - .. - ■ . 二 :二 - aai. P^-~= -.-a : .-.r-j.us..r. ::: _ 

_[ImII^V |Mtl - _(21)_ 

—oc 

Definition. H+ is the subspace of L 2 (5consisting of functions u whose negative 
-eeeffieieftts-are-^-er-e: . . - ^ - 

w in H+ iff Uf ： = 0 for k <0. (22) 

Given u in L 2 , of form (19)，we define its projection P+ onto H+ by 
oc 

P +M = I>〆 0 ， (23) 

0 

where w* are its Fourier coefficients. It follows from (21) that 

l|P+ll = I- (24) 

The space H- and the projection onto it are defined similarly. 

The space H+ consists of boundary values of functions analytic in the unit disk. 
The elements of H— are boundary values of functions antianalytic in the unit disk, 
that is, functions whose complex conjugate is analytic. 

Definition. Let s(6) be a continuous complex-valued function on the unit circle 5 1 • 
We associate with the function 5 the Toeplitz operator T 5 , mapping 7/+ into H+, 
given by the formula 

1>=P + Cs ： m )， u in H + . (25) 

In words, T s is multiplication by s, followed by projection into //+• We call the 
symbol of T s . Clearly, T s depends linearly on its symbol: T,s+ r =T S + T r . 

When we represent functions of class Hj- in terms of their Fourier coefficients, a 
Toeplitz operator becomes a truncated discrete convolution: 

00 

(Tjw)^ = s k—j l{ jt /r = 0, 1 ， • •. • （25’) 

;=0 

Here s n and u n denote the /?th Fourier coefficients of the functions s and w ， respec¬ 
tively. The semiinfinite matrix in (25’）has identical entries along each of its dexter 
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diagonals k — j = const. Such matrices are called Toeplitz matrices', they ari; 
naturally in discretizations of partial differential operators (e.g., see S. Parter a 
S. Osher) and in statistical mechanics (see McCoy). 

Theorem 7. Let s be a continuous complex valued function on 5 1 , T s the Toep 
operator with symbol s. Then 

(i) T 5 is a bounded map of H+ H+，and 



l|T,|| < klmax- 

(ii) Ifsis nowhere zero on S l f then has finite index. 

m 


indT^ = 一 WO )， 


where is the winding number ofs{9) around the origin. 


Recall that the winding number of a curve s(6) around 0 is the increase in the ar K 
merit of s {9) as 0 goes from 0 to 2n, divided by 2n. Analytically 

w(s) = im / s(dr l —de. ( 

Jo de 

Proof, (i) Multiplication by ^ is a bounded operation, with bound |j| maX} and 
is bounded, with norm 1. Since T s is the product of these two, (26) follows. 

(ii) We claim that T 5 ~i is a pseudoinverse of T 5 . To verify this, we need 


Lemma 8. For s continuous, 

C =jP+j -s ( 


is a compact map of H+ into L 2 . 

Proof. Since s is continuous, given any € we can approximate s uniformly b 
trigonometric polynomial so that 


















It follows from (26) that 

I|T V (/) < lk(0 -^(Ollmax ； 

since s(9, t) depends continuously on t, T s ⑴ depends continuously on t in the noi 
topology. Appealing to the homotopy invariance of the index, theorem 5"，we co 


lomDining mis wun ana me laenuncauon or n as me winaing numoer or s we 



In the course of proving theorem 7, we have shown that for the special function 
s t y(d) == e lNd , the dimension of the nullspace of Ty is either 0 or N, depending on 
the sign of N. This turns out to be true for all functions s. 

Theorem 10. Let s be a continuous, complex-valued, nowhere zero function on the 
unit circle S l , the Toeplitz operator with symbol s. Then 

(i) If W(^) > 0, then is one-to-one, and its range has codimension W(s). 

(ii) If VK(^) < 0, then has a nullspace of dimension W (j), and maps H+ onto 

(iii) If W"(5) = 0, then T s is invertible. 

Proof. We first prove part (iii); when the winding number of s is zero, s has a 
single-valued logarithm: 

s(&) = exp g(0 )， t{0) = logs (9). 

We split i into its analytic and antianalytic parts:-— - 

t = £ + in//+, LinBL. 

We assume at first that s is smooth, say C°°; then so is l and so are £+ and 
Exponentiate: 

s = — e ijr e l - = j+j— (32) 

-The-f 而 etion~ y^s --t-h€-boi ffldary - valuc ■ ★ 讀 - 池 al_ - 









Express s as s+s^., and divide by S -： 

s + u =sZ ] f + sZ ] g— •— 一一 ... 

Since ^Z 1 = exp 一 £ 一 ， the product 二 1 g - belongs to H-\ applying P+ gives s^u = 
P+^Z 1 /* Divide by s^.: 

.. mf.p 只 : J -六 —-.--- -( 32 l ) 


This shows that •yfPs 二 1 is the inverse of T 5 . 

— ..We .tum-now_.to-.parts (i) and..(ii)•„.Denote.the ； ,w.indiiigjpu.nib.er,,.of, s m by W..Tbg- 
function se^ 6 has a winding number zero;Jherefore the mapping u-^ f giveffby 


is invertible. This is the same as saying that T.? maps H^. one-to-one onto H+. 
From this (i) and (ii) follow. 口 


We remark that formula (320 is not only a theoretical tool but a practical method 
for inverting T s . 

We return now to the case when s is merely continuous, and W(s) = 0. For any 
e > 0 we can approximate s uniformly by a smooth function r so that 

I 厂一 * simax < (33) 

For e small enough 

|/- 1 5 -l| m ax<^. (330 

We draw two conclusions from this inequality: 


(i) It follows from (330 combined (26) that 

IIV.,-111 <i 

It follows from theorem 2 of chapter 17 that T r -i 5 is an invertible operator. 

(ii) It follows from (330 that r and 5* have the same winding number. Since we 
have assumed that WX«y) = 0, also W(r) = 0. Since r is smooth, it can be 
factored as in (32): 




where is the boundary value of 
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We claim that the operator T r -\ s can be factored as follows: 

T r-^ = T r ~« 5r -i = P+Klbr; 1 = P+dP+sP〆; 1 = T r : 

This is so because the operator P+ to the left of acts as the idem 
P+ to the left of s removes an antianalytic function that would hav, 
by the leftmost operator P+. As observed above, the operator T,._ 

invertible, so are the two operators T r -i and T r -i on the right, beca 

numbers of r + and r 一 are zero. It follows therefore that the third 
product on the right, T_y, is invertible too. This proves part (iii) of the 
The proof above is due to Gohberg, who pointed out that it also t 
wise continuous functions s, provided that there is some continuous 
that inequality (33’）is satisfied, with any constant on the right less tl 

- llmax < C, C < 1. 

The geometric meaning of the restriction (33") becomes clear if w 

sides by \r\ and write it as |r(0) — j(0)l < r(0), valid for oil 0. If r(6 

of a person taking a walk around the origin, and s(9) the position 
dog, held on a leash of variable length but always less then c|r(0)|, 
busily the dog jumps around in the circle to which it is confined by ti 
circle the origin the same number of times as its master. For a full disc 
to Gohberg and Krupnik, as well as the last chapter in Douglas. 

Krein and Gohberg have extended theorem 7 to n x n matrix-v 
S(9), acting by multiplication on vector-valued functions u(9). For 
note by H+ that subspace of the Lr vector-valued functions on S l 
Fourier coefficients are zero. P+ is orthogonal projection onto 
S(0) denotes a continuous matrix-valued function on 5 1 wh 
complex-valued functions. Since S(0) is a bounded function, the 
operator 


T s = P+S 

is a bounded mapping of H + H + . ' 


Theorem 11. Let S(0) denote a continuous, complex matrix valued, 
invertible at all points ofS 1 . 

(i) Ts : H+ H+ defined by (34) has a pseudoinverse. ... 

(ii) Since S(6) is invertible, s(9) = detS(0 )nonzero on S x ; we 



indTs = -W(detS). 

T s ~i is a pseudoinverse of Ts ； the proof is the same as in 
its index, we deform S into something simple, using the foil 


ical result: 
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Lemma 12. Two matrix-valued continuous functions on invertible at all points 
^ccu ： iJM~defamed~cmtimwusfy4nta~each~other-Mdthin this class of.functions iff the 
winding number of their detenninants are equal - 


Starting with any S, we deform it into S" of diagonal form: 


-SAT(0r= 




i^W:(defS-)r 


where all diagonal entries but the first are = 1. The matrix Toeplitz operator Tn, 
whose symbol is Syv above, is the direct sum pf scalar Toeplitz operators: its index 
-can~be-CQm putec i- CQmp oQentwise~l3yjQnmiia 
follows from the homotopy invariance of the index. 

Exercise L Show that for 


S(6) = 


dim Nr s = 3, codim Rr s = 


The preceding example shows that theorem 10 is false for matrix-valued symbols. 
When S(0) can be factoredas S = 5-S+, S- antianalytic, analytic, both invert¬ 
ible at every point of the unit disk, then 1 P+ 5 1 二 1 . Even when it exists, such 

a factorization can no longer be performed by taking logarithms. A method based on 
solving a system of PDE —s is given in Lax. 

In the proof of theorem 11 we have made essential use of topological notions 
and results. Conversely, notions and results from index theory are powerful tools in 
differentia] topology; a basic result of this kind is the Atiyah-Singer index theorem. 

An important extension of the theory of Toeplitz operators, replacing S l by the 
real axis, has been given by Wiener and Hopf. An extension to functions of two 
variables has been given by Strang. Further generalizations of the notion of Toeplitz 
operators are due to L. Boutet de Monvel and V. Guillemin, and to C. Berger and 
Coburn. In the farthest reaching generalization the notion of dimension of nullspace 
and range is taken in the sense of the dimension function in von Neumann algebras. 


27.3 HANKEL OPERATORS 

A companion to Toeplitz operators can be constructed by replacing the projection 
into H+ by projection into H -： 












If we represent functions of class and //-by their Fourier coefficients, a Hankel 
operator appears as 

(Hy^)^ = y ]s/c^.juj , /r = 0 ， l，... （ 35’) 

y=o 

Here s n denotes the (—;z)th Fourier coefficient of s, and uj,j == 0, 1,... the Fourier 
coefficients of u. Note that in order to maintain symmetry we have included the 
zeroth Fourier term in both H 七 and H— The semi-infinite matrix representing a 
Hankel operator has identical entries along each sinister diagonal k + j = const. 
Such a matrix is called a Hankel matrix. 

Exercise 2. Show that every Hankel operator is compact. 

Exercise 3. Show that the norm of a Hankel operator H. y satisfies 

IIH^rll 5 inf — I max ， （ 36) 

where q ranges over all analytic functions in the unit disk that are continuous on 
the unit circle 5 1 , and zero at 4 = 0. According to a theorem of Nehari, the sign of 
equality holds in (36). 

For further reading turn to chapter 19 of Hormander. 
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COMPACT SYMMETRIC 
OPERATORS IN 
HILBERT SPACE 


One of the most beautiful — as well as the most useful — results of linear algebra is 
the spectral theory of hermitean symmetric matrices. We recall that a matrix A is 
hermitean symmetric if it is its own adjoint: 

A* = A. 


The spectral theory of such matrices says that A has a complete set of orthogonal 
eigenvectors, and that the corresponding eigenvalues are real. This result has a perfect 
generalization, due to Hilbert, to hermitean symmetric operators in a Hilbert space 
that are compact. In this chapter we present this generalized theory, and give some 
concrete applications. 


Definition. An operator A mapping a complex Hilbert space H into itself is called 
hermitean symmetric — symmetric for short — if it is its own adjoint, that is, if for all 
x and yinH 

--- XAxry) 土 ( 乂，雄 （ l) 


Exercise 1. Show that a symmetric operator A as above is closed. Show that A is 
bounded. 


"Theorem trLet' A Wa symmetric operator: 

(i) The (hermitean) quadratic form (Ax t x) is real for all x in H. 

(li) The quadratic form (2) is not identically zero unless the operator A s 0. 


一伽矿 Setx 三 ' Xlsr( 1 }r 


⑵ 
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(A.v, x) = (x, Ax). 


: i 、 





Since the scalar product in a complex Hilbert space is skew symmetric, (2) is real. If 
CAx, X) = U for all x, then by setting x"zhyin place of x we deduce that the bilinear 
form- (Ax, )0 ==• 0 for all -x and y in H. It follows that Aa* = 0 for all x. □ 

Definition. A symmetric operator K mapping a 

positive if the associated quadratic form (Kx,a*) is nonnegative for every x in H. 
TKinS Mft 為初貧 oi Kr 一一 — 一 … 


Definition. Let A and B denote two symmetric operators mapping a Hilbert space 

H into itself. The inequality A < B (A less than B, B greater than A) means that 



Exercise 2. Show that the sum of two positive operator is positive. 


Exercise 3. Show that if A < 5^ and C < D, then A + C < +D. 

Strict positivity, and strict inequality are defined analogously. 

In this chapter we will study the spectral theory of compact symmetric operators. 
We recall from chapter 21 that an operator A : H H is called compact if it maps 
the unit ball in H into a compact set, that is, if the set [Ax, ||a.|| < 1} is precompact. 
We recall for convenience the notion of precompact; a set is precompact if its closure 
is compact. In a metric space this ean be expressed in two equivalent ways: 

Definition I. A subset R of a metric space is precompact if every sequence {z H } of 
vectors in R contains a convergent subsequence. 

Definition II. A subset R is precompact if for any e > 0 the set R can be covered 
by a finite number of balls of radius e. 

Theorem 2. A compact symmetric operator A maps a weakly convergent sequence 
[x n } into a sequence {Ax n } that is strongly convergent. 


Proof. It follows from (1) that if {x /z } converges weakly, also {Ax,,} converges 
weakly. A weakly convergent sequence is uniformly bounded: \\x n \\ < const. It 
follows from the definition of a compact operator that {Ax n } lies in a precompact 
set; so by definition I a subsequence of {Aa*"} converges strongly to some limit z. 
We claim that the whole sequence converges to z ； for it not, there would be some e- 
ball centered at z such that infinitely many Ax n lie outside it. By precompactness, this 
sequence has a subsequence that converges strongly to a limit^; clearly, || > 
But that contradicts the weak convergence of the whole sequence {Ax /7 ). □ 


2 state now the main result of this chapter, the spectral theorem: 

rem 3. A denotes a compact symmetric operator mapping a complex Hilbert 
、 H into Itself. Then there is an orthononnal base {z„} for H consisting of 
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eigenvector of A.: 

A.Z.n == (Xfj Z，n • (3) 

The eigenvalues a n are real and their only point of accumulation is 0. 

Proof. If A is the zero operator, any orthonormal basis will do. Suppose that A _ 
0. By theorem 2, the quadratic form (A.v, x) ^ 0, so it takes on some nonzero value, 
say positive. Look at the values of the quadratic form on the unit sphere ||.v|| = 1. 
Since A is a bounded operator, (A-r, x) does not exceed ||A|| on the unit sphere; we 
claim that it achieves its maximum there. To see this, denote by m its supremum: 



it \\z\\ were less man i, {Ay, y) ' 
(4) of m as the supremum. 

The homogeneous function 


oe greater man m, coniraaicung ine aenmuon 






ll-vll 2 


⑶ 


is called the Rayleigh quotient. Clearly, the vector z maximizes R\(z) among all 
















ERATORS IN HILBERT SPACE 


Since w is an arbitrary vector, Az — mz = 0 i.e. z.is an eigenvector of A, with 

-- - - — ___ 

— P- n ? e _ _V^-h? ve P rove _4 Jl e 吃 n_cp of one eigenvector, we can deduce the exis¬ 

tence of a complete set. This is based on the observation that for a symmetric opera¬ 
tor A, the orthogonal complement of an eigenvector is invariant under A. To see this, 
suppose that z is an eigenvector, and y is orthogonal to z\ then Ay is ortHogonalTo z ~ 
-— s •- -because- 二 : __ 二 從二 ， ： —— 二：； 二， 


(Ay 9 z) = (y, Az) = ()», Az) = 0. 

~~Il.folto.ws that if y is ortho 耸 onaLfiiXbiille—ctio ir {‘} of eigenvector, so is Av._ 

Now take {zni\ to'Be fhe collection "of all eigenvectors of A; denote by Y their 
orthogonal complement. As shown above A maps Y into itself, and of course, A 
restricted to Y is symmetric. Therefore, as shown above, unless Y consists of the zero 
vector, it would contain an eigenvector of A, a contradiction since Y is orthogonal to 
all eigenvector. □ 

Exercise 4. Show, using theorem 2, that the sequence of eigenvalues tend to zero. 

Exercise 5. Show that ifw — limx n = x, and if lim ||x„|| = ||x||, then x n converges 
strongly tox. 


The argument used to prove the existence of the first eigenvector was constructive. 
The same constructive argument can be used to furnish all subsequent eigenvectors. 
Arrange the positive eigenvalues of A in decreasing order: 


Then 


Az n = a n Ztu 0i\ > QT2 • • • > 0. 


a/v = max 


(A^c,x) 

—W 2 . 


⑹ 
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The vector that maximizes (6’）is the Nth eigenvector. The negative eigenvalues of 
A can be characterized by similar minimum problems. 

Suppose, as is often the case, that we are interested in the eigenvalues rather than 
the eigenvectors; then formula (6’）is not so useful, for this maximum problem in- 
volves explicitly the unwanted eigenvectors. Fortunately there are formulas, two dis- 
tinct ones, one due to E. Fischer, the other to R. Courant, that characterize the Nth 
eigenvalue of A without reference to eigenvectors of previous eigenvalues. 


be a compact symmetric operator; denote its positive eigenv 
⑽ sing order, bya kt k = l t 2,..see (6). Denote by R A (x) 









Similar pairs of variational principles hold for the negative eigenvalues. 

In a finite-dimensional space (7) and (8) hold for all eigenvalues, positive and 
negative. In this case the two principles are equivalent; (7) applied to 一 A gives (8). 

In an infinite-dimensional Hilbert space (7) and (8) are distinct; (7) can be used to 
give lower bounds for the Nth positive eigenvalue，whereas (8) can be used to give 
upper bounds. 
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all X ； the conclusion now follows from either Fisher's principle (7) or Cdilrant^ 
principle (8). □ 

For negative eigenvalues the opposite inequality holds. 

Exercise-6. —Show-thata compact positive-symmetric operator has no negative eigen¬ 
values. . .. .-. . 

In chapter 17 weTTave developed a functiiorial~calculus for elements of a Banach 
algebra, in particular, for bounded operators mapping a Banach space into itself. The 
class of functions / for which we were able to define /(A) consisted of the set of 
iimcdxms^a3ylicJ3D^^apen-setxo.n.t.ai.rii.ng ..the-spectrum of. A.„We .shawjio.wJhat. 
thanks to the spectral theory developed in this chapter, we can for A symmetric and 
compact, define /(A) for every / defined on the spectrum of A. 

Theorem 6. Let A be a compact symmetric operatoj ： We can assign to evety 
bounded complex-valued function f(a) defined on the spectrum of A an opera¬ 
tor that we denote as /(A), so that 

(i) the operator assigned to the function f(a) = 1 is the identity I. 

(ii) the operator assigned to the identity function f{a) = a is A. 

(iii) the assignment f /(A) is a isomorphism of the ring of bounded function 

on a (A) into the algebra of bounded maps ofH into H. 

(iv) this isomorphism is isometric: 

|i/(A)|[ = sup . \f(a)l 
o in a (A) 

(v) when f is real valued, /(A) is symmetric. 

(vi) when f is positive on the spectrum of A, so is /(A). 

Proof. The proof is shorter than the statement of the theorem. Denote by {<:„} an 
orthonormal basis consisting of eigenvectors of A, with eigenvalue a n . Express x in 
H in terms of this basis: 

x ― 〉 : ， (12) 

and define /(A) to act as follows: 

/(A)A- = 2/(a„)c nZ „. (13) 

Properties (i) to (vi) are now obvious. 


□ 










SPACE 


Exercise 7. Prove that the positive square root of A is unique, i.e. that there is no 
other positive operator whose square is A. 

Here is a no-cost extension of the spectral theorem for compact symmetric oper¬ 
ators: 

Theorem 7. Let {A^} be a collection of symmetric operators mapping a Hilbert 
space H into itself that commute pairwise: A y As = A^Aj,. Suppose that at least one 
of the Ay is compact; then there cm orthononnal basis {“} consisting of common 
eigenvectors of all the Ay： 

Ay Zn = (Xn(y)Zn- (14) 

Proof. Denote by A one of the compact operators in the collection, by a n its 
eigenvalues. Denote by S n the eigenspace corresponding to a n , that is, the space 
of vectors : satisfying 

Az = a n z- (14 ; ) 

It follows from theorem 3 that each S n is finite dimensional, that they are orthogonal, 
and that they span H: 

H = 5| ㊉ … • 

Each S n is invariant under all the other A y . To see this, let A y act on (14 ’)； using 
commutativity, we get 

Ay \z = AAy z = dfl A.J/ 

Restricted to S IU A y is a symmetric operator; therefore S n is the orthogonal sum 
of eigenspaces of A y . We take now another operator of the collection and de¬ 
compose each of these common eigenspaces of A and A y into an orthogonal sum 
of eigenspaces of A, 乂 ， and so on. Since is finite dimensional, this process must 

_ come to an end after a finite number of steps with a decomposition of S n into a sum 

of eigenspaces for all operators in the collection. Then we turn to S n +\ and repeat 
the process, and so on. □ 

- 盈從祁 ^&~Show r -wittic>ut-Eecaur*se-to-t:heorern-l v that.eigeiwectors-ofa.symmetric... 

…: operator belonging to distinct eigenvalues are orthogonal. 

Theorem 7 has the following important application to normal operators: 

Definition. An operator N mapping a Hilbert space H into itself is called normal if 
— —— N and its. adjoint commute: 


N*N== NN*. 
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Corollary 2. Every compact ndriiial operator has a complete set of diiHonoiindl 
eigenvectors. __ 


p roo f -Decompose N into the sum of its symmetric and antisymmetric parts: 

XT n „ N + N* T N-N*. 

N = R + J, where R = — 1 — — ， J 三 二二〒 ■ : : 二 T -- 

-—— Cleariy^is^symmetekvJ- aatisy- mmetriC r^ d N ， = R^J, S i nce^land-N^CQgmute，- 

so do R and J. According to Schauder’s theorem, theorem 7 in chapter 21, the adjoint 
N* of the compact operator N is compact; therefore so are J and R. 

We appeal now to theorem 7 and conclude that R and J have a complete orthonor- 
::i: 二： ^ clearly, these are eigenvectors of iN as well. CJ 

Definition. An operator U is called unitary if it maps H onto itself isometxically, 
that is, ||lk|| = M. 

Exercise 9. Show that a unitary map U satisfies U*U = I. 

Exercise 10. Let U be a unitary operator of form I+C, C compact. Show that U has 
a complete set of orthonormal eigenvectors, and that all eigenvalues have absolute 
value 1. 
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EXAMPLES OF COMPACT 
SYMMETRIC OPERATORS 


The spectral theory developed in the previous chapter is one of the workhorses of 
analysis. In this chapter we present a few examples. 

29.1 CONVOLUTION 

We take the unit circle 5 1 and form the Hilbert space H = L 2 {S^), namely the square 
integrable functions. Let a denote any complex-valued function of class L l (S l ), and 
define the operator A: H H rs convolution with a: 

(Aw)Cv) = [ a(y) u(x - y) dy. (1) 

JS f 

By a change of variables we can also write 

(Au)(x) = / a(x-y) u(y)dy. 

Js 、 

Theorem 1. ， 

(i) Evety convolution operator commutes with translation: 

(T c w)Ct) = u{x + c). 

(ii) Any two convolution operator commute. 

Exercise 1. Prove theorem 1. 

Theorem 2. 

- (I) 一 Prd 百 fimdltf (Yyis a bounded operator, and 

I|A||<N l i. (2) 
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(ii) A is q compact operator. 

(iii) A is a normal operaton - ; - 

(iv) If a satisfies . .. . ...—. . .— 

a(^x)^a(x), D 

then A is symmetric. 

’ lO'Approximate tfieTrifegfal diTtfie right or(T)'by a sum 

(Am) (x) ~ h D h)u(x-nh). 

- Usin_gfteMan’gle4ne 明 - 

/j^|a (;?/?) I ||«||. 

From this inequality (2) follows for smooth a and u by letting h tend to zero. For u 
in Lr and a in we use approximation by smooth functions. 

(ii) Suppose that ais in L 2 (S l ); then we can use the Schwarz inequality to estimate 
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to other compact commutative group. Thus in precisely this fashion Hermann Weyl 
succeeded in proving that the exponentials A real, are completelh the space of 
■ almost periodic functions.-For details, see the-F. Riesz and Sz；-Nagy text, Functional 
Analysis. 


29.2 TEEimWkMWXW^ 


Denote by L the second-order differential operator 



acting on functions on the interval [0,2 丌 ] and vanishing at the endpoints. Here q is 
a continuous real valued function, say bounded from below by 1: 


gM > l. (10) 

It is not hard to show (e.g., see chapter 7) that the boundary value problem 

Lw = /, «(0) = 0, u{2n) = 0, (11) 

has a unique solution u for any given continuous function /. We denote the depen¬ 
dence of this solution w on / by A: 

A/ = «; (12) 

in words, A is the inverse of L. We will show below that A as defined above is 
bounded in the L 2 -(0, 2n) norm, and thus can be extended by continuity to the whole 
Hilbert space H = L 2 . 

Theorem 3. The operator A as defined above is 

(i) bounded. 

(ii) compact. 

(iii) symmetric. 

(iv) positive with respect to the L 2 scalar product. 


Proof. For u twice continuously differentiable, multiply equation (11) by u and 
integrate over [0, 2n]. Using definition (9) of L, integration by parts gives 


J {u\ + q w 2 ) dx=z J u f dx . 


(13) 


We estimate the right side by Schwarz’ inequality, and use the arithmetic geometric 
mean inequality and restriction (10) on the left; we get 
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J u l + ^J u2 dx <^J f 2 dx. 

The boundedness of A follows from this. It also follows that A maps the unit ball in 
the L 2 norm into a set of functions u for which ||w. v || 2 < 1. According to Rellich，s 
criterion (see chapter 22) this set is precompact; this proves that A is a compact 
operator. 

(iii) The symmetry of A follows from that of L: for u, v twice differentiable，we 
get by integration by parts that 

(Lw, v) = (m ， Lu). 

Setting Lu = /, Li; =： g, we get 

(/’Ag) = (A/, 客 ). 

(iv) Since the right side of (13) is (A/, /), positivity of A follows. 

Since A as defined for bounded /is a bounded operator in the L 2 -norm, it can be 
extended by continuity to all / in L 2 . The extended operator retains all properties 
listed in theorem 3. □ 

Exercise 2. Show that for all / in L 2 , A / is continuous and is zero at 0 and it. 
(Hint: Use the estimate \\u x \\ < ^||/||.) 

We can now.apply the main result of chapter 28 to // = L 2 (0, lit) and conclude 
that the operator A has a complete set of orthonormal eigenfunctions [e n ] in L 2 : 

A% = cifi en . (14) 

Since A is positive, so are its eigenvalues. 

Using the result of exercise 2, we conclude from (14) that each e n is continuous. 
For such e, L is the inverse of A, so we apply L to (14) to conclude that 

- Le n =k n e n , X n = (14 ’） 


The eigenvalues a n of A tend to zero; it follows from (14/ that the eigenvalues 
X n of L tend to infinity_ _ 


29.3 THE INVERSE OF PARTIAL DIFFERENTIAL OPERATORS 


The analysis in the last section can be applied, with a few changes, to partial differ- * 
ential operators. The simplest case is 


L = —△， 


( 15 ) 
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the Laplace operator A = 8j 二 3/3A*y.Let G be a domain in IK” ， and consider 
the boundary value problem - - 

L« = / in G, w = 0 on the boundary of G. (16) 

U is a basic result of the theory of partial differential equations (see chapter 7) that for 
./ sufficiently differentiable and G smoothly bounded, this boundary value.probJem.^ 

- unique solution u. As before, w_e. denote thi$..s.oJut.iQn .w_a$_A/...The„an.al.Qgue 

of theorem 3 holds in this case too and is proved by the same technique, namely 
integration by parts. This allows the extension of A by continuity to all of H = 

L (G)，and the application of the spectral theorem of chapter 28 to conclude that A 
— has orthonofmal. set of eigenfmxrtf 加 s 叫； wit^pcxsitive^igOTvaiffesicejTr- 

The argument that was used to prove that A is a compact operator shows that A 
i‘s a smoothing operator. A not too technical argument shows that for a large enough 
々， A v is highly smoothing, that is, it turns any Lr{G) function into one that is as of- 
ten differentiable as we wish. Applying the operator A many times to the eigenvalue 
equation ke = ae gives A k e = a k e. It follows that the eigenfunction e is suffi- 
cientJy differentiable; therefore LA^ = e. Applying L to the eigenvalue equation 
gives therefore 

L e n = X n e n , X n = a /2 \ 

meaning that the e,/ areeigenfunctions of the differential operator L as well. 

This setup is applicable to more general elliptic operators. For instance, we can 
set 

L = - ^ 3/ a u dj + g, 

where (cijj) is a positive definite matrix whose entries are smooth functions of x and 
只 is a smooth nonnegative function. The boundary condition w = 0 on the bound- 
ar y may be replaced by others, for instance, the Neumann condition that the normal 
derivative of u vanish on the boundary. But note that whereas for the boundary con¬ 
dition u = 0 smoothness of the boundary may be relinquished, it cannot be for the 
Neumann condition. 

Thu Fischer and Courant variational principals for the eigenvalues of A can be 

Iranslaled into variational principles for the eigenvalues of L. They yield excellent 

asymptotic estimates for the size of the nth eigenvalue; see Hermann Weyl, and 

Courant quoted in chapter 28. 
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TRACE CLASS AND 
TRACE FORMULA 


A remarkable result of linear algebra is the trace formula, which says that the sum 
of the eigenvalues of a square matrix equals the trace of the matrix defined as the 
sum of its diagonal elements. In 1959 Lidskii showed that this relation is valid also 
for a large class of compact operator in Hilbert space. This result is rather deep, and 
its proof correspondingly tricky. Lidskii’s trace formula is a powerful tool in many 
branches of analysis. 

30.1 POLAR DECOMPOSITION AND SINGULAR VALUES 

Let // be a separable Hilbert space over C, and T some compact operator mapping H 
into itself. Denote the adjoint of T as T*; the product T*T is clearly a nonnegative, 
symmetric operator; according to the functional calculus described in chapter 28, 
T*T has a uniquely determined positive square root A = (T*T) V 2 . For any it in H, 

||T«|| 2 = (Tw,Tw) = («, T*T«) = («， A 2 w) = (Au, Ait) = ||A“|| 2 . (1) 

As we apply (1) to w — u ， we deduce that if Au = Ay, then Tu ― Tv. This enables 
_us_to..define the..operator U.on the range of A as follows: 

U : A« hi. (2) 


It further follows from (1) that U is an isometry on the range of A. 



:: men toft 


U /2 = 0 for/z 丄 /?. 

Since (U/i, v) = (/z, U*v) = 0 for n±R y and for all v, it follows that U* maps H 
into the orthogonal complement of 尺丄 ; thus the range of U* lies in the closure of ^ 
of ^"^WecTaim that 

U*Uiy = w for w in 'R, (3) 

329 

% 





To see this take any z and w in R. Since U is an isometry on R, it also preserves the 
scalar product of any two elements in ~R: 

(z，uO = (Uz ， Uu;) = (z ， U*UuO. 

——ItJollows that (z, U*Uw; — w) = 0. Since z is an arbitrary elements of R, 

_ __ U*Uu;- w; 丄 i?. 

Qn the other hand, we have shown that U* maps H into i?. So we see that for w in 
/? ， U*Uw; — w both belongs to and is perpendicular to it. Therefore ⑶ follows. 
We summarize the information contained in (2) and (3): _ — 

Theorem 1. Every compact operator T can be factored as 

T = UA t (2 7 ) 

where A is a positive symmetric operator, and U*U = I on the range of A. 

The operator A is called the absolute value of T, and (2 ; ) is called the polar 
decomposition of T. 

Theorem 1 is true not only for compact operators but all bounded operator. The 
oniy place in the proof where compactness was used is the construction of the square 
root of T*T. As we will show in the next chapter, every bounded positive symmetric 
operator has a square root, not just the compact ones. 

When T is compact, so is its absolute value A. The nonzero eigenvalues of A, 
denotes as [vy }，are positive numbers that tend to zero; we index them in decreasing 
order. The numbers Sj are called the singular values of the operator T, and denoted 
afi,v/(T). J 

Exercise L Show that for each j\ Sj(T) is a continuous function of T in the norm 
topology. 


30.2 TRACE CLASS，TRACE NORM, AND TRACE 

Definition. A compact map T of a Hilbert space H into H is in trace class when 
00 

J2sjC0<oo. ⑷ 

1 

This sum (4) is called the trace nonn of T: 

Pltr = J>(T). 

Exercise 2. Show that ||r|| < ||r||tr. 
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TRACE CLASS, TRACE NORM, AN 
The next theorem enumerates 
Theorem 2. Let T be a trace cli 


(i) l|T||, r =||T*|| tr 

(ii) l|BT||tr < ||B|| IITHtr 
(Hi) IITBUtr < ||B|| ||T|| tr . 








that 

J ; ;(TB) = sj(B*T*) < ||B*|| 2 ~(T*) = ||B|| 巧 (T). (5 ，） 

Summing over j we deduce (iii). 

To prove (iv),-w& establish the-following characterization of the trace class and 
: . trace norm^ . . — ___:•. . . 

Pll tr = Sffp-gtd7，i r^TlT. . - -―… 一 (6) 

n 

where the supremum is taken over all pairs of orthonormal bases {/,,} and {e n }. 
r:~~~^teve-t0-&hew^at-the^^sMe~of~^)^e^~e^eds 叫 |T|ttrrandequals-it-for-—- 
appropriate choice of f n and e n . * 

Denote by zj the normalized eigenvectors of the absolute value A: 

kz j =s j z h = 

For any vector /, we can expand, 

/ = A/ = Zj)Zj. 

j j 

Apply U to both sides; using the polar decomposition T = UA S we get 

T f = J2 s jQf ， z j) w j 、 ( 7 ) 

where Wj = U ". According to theorem 1, vjj form an orthogonal basis of the range 
of A. We take the scalar product of (7) with e: 

(T /, e) = ^sj(f,zj)(wj,e) . (7 ’） 

We set now / = /«, e = e n and sum over n: 

22(T/ n ，e„) = —(8) 

We claim that the double series on the right converges absolutely and is < ||T|| tr . To 
see this, we sum first with respect to n and apply the Schwarz inequality; we get the 
following estimate for the sum on the right in (8): 


j \ » « 


1/2 


By the Parseval relation 


|(/«>2j)| 2 = ll^jll 2 = 1. XI |(^;.«n)| 2 = Nyll 2 = 1- 

n n 


This shows that the right side of ⑻ is bounded by = l|T|| tr . 
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:: (ii) 

(iii) trT* = tirT. 

(iv) For any bounded operator B, tr T B = tr B T. 

Proof. Inequality (i) was derived in the course of proving the convergence of (8). 
■ Properties (ii) and (iii) follow from the definition (9) of trace. To prove (iv), we start 
with formula (7), and let B act on both side: 

—— ― 

SO 

wny=j^^jTj)(R'wj, w - 

Set / = f n and sum with respect to n. Reversing the order of summation and using 
the Parseval relation as in the derivation of (11), we get 

trBT=^^(T)(BK ； y )Z7 ). '(12) 

On the other hand, replacing / by B / in (7) gives 

TB / = ^>j(T)(B/ ， z 7 .) % = E 〜 (T)(/ ， B*~) 叼 . 
j 

Proceeding as before, we get 

trTB = ^^(T)(u)j,B* Zj ) = J^SjiTXBwj^j), 
which is the same as formula (12) for trBT. □ 


30.3 THE TRACE FORMULA 

The deepest, and most important, property of the trace was proved by Lidskii in 
1959: 

Theorem 5. The trace of a trace class operator is the sum of its eigenvalues: 

trT = [> ; _(T). (13) 

The identity (13) is called the trace formula. 

Proof. When T is a normal operator of trace class, we can, according to corollary 7 
of chapter 28, choose an orthonormal basis consisting of eigenvectors of T. By (9), 

tr T = y~^(T fn i //i)= > : 入 " ， 

proving (13). 
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For T not normal, the eigenvector are in general not orthogonal，and there may be 
generalized eigenvectors: 


T = 入 /j or T* = 入 /j 山找 + W/i —\. 

We can, by the Gram-Schmidt process，orthonormalize them; f n is a linear combi¬ 
nation of ,..., u;,i so that 

T f n == k n f n + linear combination of /i ， . • • ， /" 一 i ， 

Since the f n have been chosen to be orthonormal, 

(T fni fn) — 

Summing over all n would yield the trace formula, provided that the f n form a basis 
for the whole Hilbert space. They do if the eigenvector and generalized eigenvector 
span the whole space, but if they don’t, then the f n have to be supplemented by an 
orthonormal basis h m for the orthogonal complement of the eigenvector of T. The 
expression of the trace of T now reads 

trT = X ； (T/„，/„) + ^(T/i m ,/i m ) = E 入 ，, + ^2(Th m ,h in ). (14) 

The task is to show that the second sum on the right is zero. For this we need some 
lemmas. 

Lemma 6. Let T be a compact operator on a Hilbert space H，K the orthogonal 
complement of its eigenvectors and generalized eigenvectors. 

(i) K is an invariant subspace ofT*. 

(ii) The spectrum ofT* over K consist of the single point 入 = 0. 

Proof, (i) Let e be an eigenvector，possibly generalized, of T: 

Te^Xe + f, 

- /Tmrnto^en^alfee^i^enYeaoTran^wpirase'ihanri 

claim that so is T*«; for 

_ (e. Tu) = (Tg, u) = (Xe + /, u) = X(g t «) + (/；«) = 0. 

(ii) r According* tcrSchauder’s: theorem，ther adjoint T* of a compact operator is 
compact. If A. were a nonzero eigenvalue of T* on K, then X would be an eigenvalue 
of T in 丑 of finite multiplicity. According^to theorem 6 of chapter 21, there is an 
integer i such that the nullspace of (T* — X) 1 equals the nullspace of (T* — 

.but is larger Than the nullspace of (T* - X)^. Let u in /T be a member of the 

(T* — X)v = u 
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has no solution; for a solution v would belong to the nullspace of but "not 

to that of (T* - X ) 1 . According to the Fredholm alternat4vM^^^ni-8 of - 

there must be an eigenvector w of T, = 0, that is not orthogonal to w. But _ _ 

this is a contradiction, for w belongs to K. and so u is orthogonal to all eigenvectors 
ofT. . ■ □ 

If T is of trace class over H, so is T*. We claim that^Westricted-to-its dnvariant^^ - 

subspace iTis of trace dass-^this. follows immediately from relation <6) character- - 

izing trace class operators. 

Back to formula (14); the second sum on the right can be rewritten as 

^y^Ohn ，- 

Since h n { is an orthonormal basis of K. this sum is the complex conjugate of the 
trace of T* over K. Its vanishing can be formulated so: 

Lidskii’s Lemma. Let T be a trace class operator that has no eigenvalues, except 
zero; then trT = 0. 

The rest of this section is devoted to proving this proposition. We start with an 
estimate of the eigenvalues of a compact operator in terms of its singular values. 

Lemma 7. Let T be a compact operator，with nonzero eigenvalues 入 ] ，入 2 , . • ，⑴、 • 
ranged in decreasing order of their absolute value, including multiplicity. Denote as 
before the singular values ofTas Sj (T), arranged similarly. Then for any N, 

N N 

n i ； vi ^ n^ (T) - (i5) 

i i 

Proof. Denote by Ey the space spanned by the first N eigenvectors of T, and 
denote by Pyv orthogonal projection onto E^. Denote by the restriction of T to 
the invariant subspace £yy* Denote by the absolute value of T/v ： 

T" =Uyv A/v. (16) 

Since the eigenvalues Xj are nonzero, T" is invertible. Then so is U", and therefore 
U" is unitary. Taking the determinant of (16) gives 

IdetT^I = det Ay^/. 

Since the determinant of a matrix is the product of its eigenvalues, we can rewrite 
this identity as 

N N 




( 17 ) 












lue ofTP 

Ejj. It follows that 入 y(A/v) = .yy(T P/v ). j = I ， . … /V. We appeal now to inequality 
(5)’; Sj(TB) < ||B|| 〜 (T). Apply this to B = P,\r： we obtain 

.vy(TP <v ) <^-(T). (17 ，） 

Since we have shown that 入 j(A,v) = Sj(T P^), setting (17’）into the right side of 
(17) yields (15). □ 

We can deduce further inequalities between \kj\ and sj with the aid of the follow¬ 
ing simple principle: 

Lemma 8. Let a\ > an > - • and b\ > bn > - -be two decreasing sequences of 
real numbers, satisfying for each N 


N N 

l I 



z an arbitrary real number. For this choice of F inequality (18’）can be reduced to 


P Q 

. . J2 (a J - - :)， (L9) 

.. i I 

where 

ci] > z fory < P t cij < z for j > P 


- and 


bj >z for j < 0. bj < z for j > Q. 






To verify (19), we observe that the right side can be characterized as 


M 

(: bj—-z ) : 


For M = P, (bj - z) is, according to (18)，greater than the left side of (19), so 

"' even rhSre W 一一一 — 一〜 …. 一 ’.- 一 -— 

It was shoWTrimeetiWWTZT 肢 r6V6iy FtiVlemma 8 is a'convex combmafiOT" 
of the points on the extreme rays. Since both sides of inequality (18’）are linear 
functions of F. and since (18 ; ) holds for every extreme F, it holds for all F, □ 


We applylemmT?~to 

cij =log| ； v y -(T)|, bj = log^(T); 


taking the logarithm of (15) shows that inequalities (18) are satisfied for this choice. 
Choosing F(x) = e x , we deduce from (18’）that 


1 1 
Choosing F{x) = lcigtmf)，r > 0, gives 


( 20 ) 


n( 

i 


N 


( 21 ) 


To estimate the trace of T，we approximate T by finite-dimensional projections. 
Let {h n } be an arbitrary orthonormal basis of the Hilbert space; denote by P// or¬ 
thogonal projection onto the span of h\ , • • •, hj^. Denote by T 八 ， the projection of T 
onto the range of P^: 


Tyv = Pyv TP//. 


( 22 ) 


Lemma 9. Suppose that T is a trace class operator that has no nonzero eigenvalue. 
Denote Tln as above. Then 


(i) Tn approaches T uniformly: 

"l^nJITV-T 卜 0. 


(ii) limtrT A ； =trT. 

(Hi) denote the spectral radius o/Tyv by a^i tends to zero as N — oo. 


Proof, (i) is true for any compact operator T, and (ii) is the definition of trace. 
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(iii) By assumption T 一入 is invertible for every 入 # 0. Given any 5 > 0, denote 
m(8) = m the quantity 

m = max ||(T 一人厂 
、 !a|>5 

By (i) we can choose M(5) so large that for N > M{S), 

I|Ta/-T||<~. 

m 

For such N and |A.| > S, (Tyy 一 T)(T 一 入 ) 一 1 has norm < 1， so 

T^-A. = Tyv—T-fT-X = [(T/v — T)(T — 入 ) 一 1 +1] (T — 入） 

is invertible when \X\ > 5. Therefore < S. 口 

Denote the eigenvalues of as X { j N \ j = 1 ，…， N, Denote by D N the polyno¬ 
mial 

dnw =j~[ (i - ^y V) ) - 口 3 ) 

Lemma 10. 

lim D N (k) = e_ Xa , a = trT, 

iV—s-oo 

uniformly on every bounded set of complex numbers X. 

Proof. Take the logarithmic derivative of (23): 

入 (AO 

_ 命 — 

Since each |A.1 N) | is < a N , we can for \k\ < l/crw expand each term on the right as 
• a geometric series: 

. . - ^=- e £ xk ~ l ' f24) 

where 

y=i 


•i. 
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For k > 1we estimate 5 j [ A/, ； crudely, as follows ： Since each < afj ；\< 

^r 1 Y1 1 久 We apply now inequality (20) 
that 

\S ( k N) \ < 4- 1 IITa-IIu- < a^r 1 ||T|| lr . A* > 1. . (25) 

For 人 • = 1 we have 

.-.s{ A，) = .trT^._ ___ (25!} 

We rewrite (24) as 


D 八 . 


+ rrT^trT^-57 
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Using the inequality I r < on all but the first M factor on the right gives 
M / oc \ 

\e- }x, < n ( 1 + 1^1 X J) ex P 1^1 E S J = P M e，X| e ' W - (26) 

I \ M-t\ 1 

where P\j is a polynomial of degree /V/，and € ； v/ = s j- 

Now choose the argument of a so that —Xa is positive, and let | 入 | tend to infinity. 
Since a polynomial grows more slowly than any exponential, we deduce from (26) 
that |of| < Since e,\j tends to zero as M tends to infinity，it follows that a = 0. 
Using lemma 10. trT = —a = 0; this completes the proof of Lidskii’s lemma，and 
thereby of the trace formula. 二 

The proof presented above for the trace formula is due to Gohberg and Krein. 
Lemmas 7 and 8 have been derived by Hermann Weyl. Lidskii’s proof relied on the 
Hadamard factorization theorem for entire functions of exponential type. 

Lidskii defined trace class by forming the linear span with complex coefficients 
of self-adjoint trace class operators. In Dunford-Schwartz trace class operators are 
defined as the product of two Hilbert-Schmidt operators; see section 30.8. The trace 
formula appears in Dunford-Schwartz, but there is no reference to Lidskii. Under 
questioning, Jack Schwartz admitted that he discovered and proved the trace formula 
independently. 


30.4 THE DETERMINANT 

In this section we sketch the definition of the determinant of operators of the form 
I + T, T of trace class, and its fundamental properties. A full discussion is given in 
Gohberg, Goldberg，and Kaashoek. 

For degenerate operators. G, those with finite-dimensional range, the definition is 
taken from linear algebra. Let G act on a Hilbert space H, and K a finite-dimensional 
subspace of H that contains the range of G. With respect to any orthonormal basis of 

K,l + G can be expressed as a matrix; the determinant of this matrix is independent 

of the choice 6flFTe (irthcinoiffi 谷 rbSIiiTSt—df the subspace - "^.' It is defined as the 

determinant of the operator I 十 G. The determinant has the usual properties: 

det (I -4- G)(I + F) = det (I + G) det (I + F), (27a) 

. . . . det (I + G) = fl(l + 〜■): . (27b) 

l 

where 入 are the eigenvalues of G acting on K. including multiplicity. For different 
choices of K vve may get a different number of eigenvalues that are zero; clearly that 

doesn’t change the right side of (27b .)， .. 

Every trace class operator T can be approximated in trace norm by degenerate 
operators; for instance, take the polar decomposition of T = UA, and approximate 



















The main result about determinants as defined above is that the two properties 
listed in (27) are retained. For proofs consult GGK. 

In chapter 24 we presented Fredholm’s theory, based on the notion of the deter- 

FT K, K a onB-dimensionai-integral-operator wiA-ceTi -- 

tinuous kernel. As we will show in section 30.6, not all operator K of this form are 
of trace clkss; so the notion of determinant can be extended beyond the one sketched 
in this section. 


30.5 EXAMPLES AND COUNTEREXAMPLES 
OF TRACE CLASS OPERATORS 


In this section we will study one-dimensional integral operators K of the form 


(Ku)(s) = / K(s,tMt)dt, 


acting on the Hilbert space H = L 2 [0,3]. 

Many—one might say almost all 一 - bounded operators that interest us are integral 
operators in one or several dimensions with kernels that may have singularities. In 
this section we will treat mostly kernels that are continuous functions. Recall that in 





(Kw, u) > 0 for all u in H, 


Then the kernel K can be expanded in a uniformly convergent series 

K(.s,t) = Y / K j e j (s)ej(t), (30) 

where k j and ej are the eigenvalues and normalized eigenfunctions ofK. 

Proof. The key fact is the elementary observation that the kernel of a positive 
integral operator is nonnegative on the diagonal. To see this, suppose, on the contrary, 
that for some r, K(r t r) were negative; then K(s, t) would be negative for s, t close 
enough to r, and then 

(Kit, u) = JJ K(s, t) u{t) u(s) ds dt 

would surely be negative for all functions w that are nonnegative and whose support 
lies close enough to r. 

Define the degenerate kernel K^/ as the partial sum of the series on the right in 
(30): 一 

N . 

K n {s, t) = ^Kjej(s) ej(t), 

1 

and denote by K,\ the integral operator with kerjnel AT//. Clearly, the difference K — 
K/v is a positive operator, for its eigenvectors are ej, and its eigenvalues Kj j > N, 







n.usinsoa 


of functions, by Dini’s theorem convergence is uniform for all s in [0, ,j. w ~， 

we can. by the Schwarz inequality, estimate the remainder of the series on the right 
in (30) and prove its convergence uniformly for all s and t. 

Call this limit Kqo ： we claim that Koo = K. To see why，denote by Koc the 
integral operator whose kernel is K^. From the definition of as the right side of 
(30)，we see that ej is an eigenfunction of K^. with eigenvalues kj. Thus K and K^：. 
•act the same way on all ej ，and therefore on all their linear combinations. Since both 
■Rtmd-K^ma]H : *metlcmfj'©p£hegGnal to all ey into zero, it follows that Ku = K^c-w 
for all functions u. But then K and Koc have the same kernel. □ 

Exercise 6. Show that an integral operator whose kernel is continuous and 一 0 is 

7^ 0* 

Set in equation (30) s = t and integrate; we get 


K(s. s) ds = 


(32) 


Since the eigenvalues of a symmetric positive operator are its singular values, we 
conclude 

Corollary 11 A. An integral operator that satisfies the hypotheses of Mercer’s theo¬ 
rem is of trace class. 


Corollary 11B. The trace of an integral operator that satisfies the hypotheses of 
Mercer's theorem equals the integral of its kernel along the diagonal. 


Formula (32) holds much more generally: 

Theorem 12. Let K be an iniegral operator of form (28), of trace class, with a con- 
nnuous kernel Then the trace of K equals the integral of its kernel along the diago¬ 
nal. 


Proof. We take first the case that the kernel is not merely a continuous function 
but a smooth one. Then the kenie 】 can be expanded as a uniformly convergent series 
of ， say，Lagrange polynomials /„: 

K(s.n = fj(s 、 

where the coefficients kj Jn are given by the usual formula for orthonormal expan¬ 
sions: 


JJ K(s.t) fj(s) f m (t)dsdt. 


We use now definition (9) of trace with the preceding orthonormal basis /„: 
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(K/ ;l ./„) = J K(s. t) f„U) dt^j f n (s) ds = k IIM 

according to the formula above for the coefficients kj jn . Summing with respect to n 
gives 

tr K = 

On the other hand, setting s f in the series for K gives 
K(s^s) = Y^kj， m fj(s)f m (s). 




, . (Lu, u) 

入 "=min max - - r~* 

Sn-l “ 丄 *5 " 一 1 (w ， “） 


It follows that for any given subspace 5 /f -i of dimension n ~ 1 



(L«, u) 

(33) 


X n < max — - . 

一 ll 乂 S, 卜 2 ("•«) 

We choose S„_i 

to consist of all polynomials of degree < /z - 

1. Then for uiS n -\. 

(Lm. u) = jj 

L(s, t) u(s) u(r) dsclt = [[ [L(.y, r) - P n (sJ)]u(s) u(t)clsclt. 

where P n is any 

. . 

function of form 

/?~2 

Pn(s,t) = Y^Clj(S)t j +bj[t)s j • 

0 

. 一 …“ ... (34J... 

According to results in approximation theory，every smooth fi 
well approximated by such functions in the L 2 -norm: 

mction L(s, t) can be 










EXAMPLES AND COUNTEREXAMPLES OF TRACE CLASS OPERATORS ^ 

We showed there that V is a compact operator mapping C[0 ? 1] into C[0, l]. It i 
equally true that V is a compact mapping of L 2 [0,1] into L 2 [0, 1]. 

Exercise 8. Show that V maps the unit ball in L 2 [0, 1] into a compact subset o 
C[0,1]. 

Note that V is an integral operator, with the discontinuous kernel 



ias no eigenruncuons in ulu, ij. v nid]jb 
V has no eigenfunctions in L 2 [0, 1] either. We 





singular values oi v cinu 
is a differential operator.) 


We remind the reader that we showed in chapter 24, theorem 5, that an integral 
perator whose kernel is Holder continuous, with Holder exponent > 5, satisfies the 
itegral form of the trace formula. 

We close this section by asking—and answering — the following question: Given 
a integral operator K, how can we decide if it has any nonzero eigenvalues? If 
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trIC /r =H. (35) 

It follows from inequality (20) that Dk/1 < |K| tr . We build the entire analytic 
function 

F(z) = J2( eKjZ - ] )- (36) 

"Since \e w -T|'< V|iiV| for |u，| < 1. the'series (36) converges for all The Taylor 
coefficients of F at s = 0 can be computed by differentiating (36) termwise: 

-:- . 0 . (ft) jb. . ... 

It follows from the assumption that xrK n = 0, and so by (35) that all the Taylor 
coefficients of F are zero, and therefore F(z) itself is zero for all 二 We claim that 
then all Kj are zero. Suppose not; let k\, .Kj be those of largest absolute value. 
Choose z so that k\z is real and positive, and let \z\ oo. Clearly, the first term in 
(36) dominates all others, and F(z) — m where m is the multiplicity of k\ . 
This contradicts F(z) = 0. □ 


30.6 THE POISSON SUMMATION FOR3V1ULA 

It this section we study integral operators of convolution form. Let / be any contin¬ 
uous function on the circle 5 1 . T/- = T is defined as convolution with /: 

(Tu)(s)= f f(s-t)u(l)dt/2yr. (37) 

-： Js 1 ' 

As we saw in chapter 29, the eigenfunctions of T are the exponentials e n (t) = e l nt : 

Te„ = J f{s-i)e in, ^/2 tt = J f(r)e~ inr clr/27ce ins =a„e inx . (38) 

So the eigenvalues are the Fourier coefficients of /• 

The kernel of the integral operator T equals / (0) at every point on the diagonal; 
so if T were a trace class operator. trT = /(0) = by the trace formula. This 
is the same as saying that the Fourier series of / at = 0 converges to /(0)， true 
for sufficiently smooth functions but not for all continuous functions; see chapter 11. 
section 11.2. This shows that noi all integral operators with a continuous kernel ore 
of trace class. 

Consider functions g defined on the whole real line, smooth and decreasing 
rapidly as |.v| tends to infinity. Define now the operator T as convolution: 

(Tu)ls) = f g(s -t)uU)dt/l 7 T. 


(39) 
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regarded as a mapping of L 2 (S ! ) into L 2 (5')• We can put this in the form (37) by 
chopping up K as the union of intervals [2nm ： 2n(m + D] of length 2jt: 

(TiOU) = J ~ 1 + - 7![m ) u ^')dt/2n. (35 ’） 

The eigenvalues ofT are given by formula (38)，which can be rewritten as follows: 

a„ = j X^ g{, ' + 2nm) e ~ ，，ir dr ^ 2n ~ j 7 g0 ) ^ d， ^ 2n = 豆⑻ / 27r ， 

where g is the Fourier transform of g. The kernel of the integral operator T equals 
[ g(2nm) at every point on the diagonal. Therefore the trace formula asserts that 

y^g(27rm) = InY^gOi)- 

This is the classical Poisson summation formula. 

The scope of this argument can be enlarged beyond this simple case. The additive 
sroup of the reals can be replaced by other, not necessarily commutative, groups, 
and the group of integer multiples of 2?r by other discrete subgroups. The celebrated 
Selberg°trace formula is a far-reaching generalization of the Poisson summation For¬ 
mula. 


30.7 HOWTO EXPRESS THE INDEX OF AN OPERATOR AS A 
DIFFERENCE OF TRACES 

We recall from chapter 27 the notion of the index of a bounded mapping F of u 
Banach space U into another Banach space V. Denote by N the nullspace, by R the 
range of F; assume that the dimension of N and the codimension of R are both finite. 

Their difference is defined as the index of F: 

ind F = dim W - codim R. (40) 

According to theorem 1 of chapter 27, an operator ¥ : U -* V has aoindex. iff it has - 

a pseudoinverse G : V U. such that 

GF = I - T, FG = I - S. (41) 

"where 7 ■ 0 U andS : V ^ V are compact maps. In this section we study the 
case when U and V are Hilbert spS 品 「 and.T anH'Slife not merely compact but are 
of trace class: 

Theorem 15. Let U and V be a pair of Hilbert spaces, F : U V ancl G : V -t U 
bounded operators that are pseucloinverses of each other in the sense of (41), where 
j : [j U an d S : V V are trace class operators. Then 

indF = trT — trS. (42) 


■Cj, 
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Proof. Multiply the first relation in (41) by F on the left, the second relation by F 
o n the right, and subtract.one from the other: 

FT = SF. (43) 

Decompose orthogonally U and V as follows:' — 

- ― _ 


Define P as the orthogorM projection of U onto Z. Since the orthogonal complement 
of Z is the nullspace N of F, it follows that FP = F. Setting this into (43) gives 


FPT = SF. (44) 

Note that PT maps Z — Z，S maps R R, and F is an invertible map of Z 
We claim that 

trPT/Z = trS//?. (45) 

where trPT/Z means the trace of PT restricted to the invariant subspace Z, and 
so on. 

Proof. Choose any unitary map Mof R onto Z. Multiply (44) on the left by M: 
MFPT = MSF = MSM 一】 MF. 

Multiply this by (MF)" 1 on the right: 

(MDCPDCMF)- 1 .= (MSM- ] ). 

All operator in parentheses map Z — Z. So by the commutative property of trace, 
theorem 4 (iv), ， 

trPT/Z = trMSM~V2 = trS/R. □ 

We express now the trace of T over C/ in terms of the trace of PT over Z. Build an 
orthonornial basis for U consisting of a basis [iij] in N and a basis {zy} in Z. Then 

trT = [(T~ ， nj) + 

Since Pz ； = Zj, we can rewrite the second sum on the right as 
J^(Tzj,Pzj) = TCPTzj. zj) = trPT/Z. 
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On the other hand, F = 0 on iV, so it follows from the first relation of (41) that T = I 
on N. Therefore 

〜 •) = Y^Oij.nj) = dim N. 

Putting together the last three relations gives 

trT = dim/V+trPT/Z. (46) 

Similarly, we express the trace of S over V in terms of the trace of 5 over R. We 
build an orthonormal base for V consisting of a basis (u;；) for W and a basis r ； 
for R. Then 

trS = Wj) + y^(Sptrj). 

We identify the second sum on the right as 

X]( S/ 7 ， r j) = trS / 尺 . 

It follows from the second relation in (41) that the range of I - S lies in R, and 
therefore is orthogonal to W. In particular, ((I - S) wj, wj) = 0, so 

J2{Swj,Wj) = Yl (w j' w j'> = dim w = codim 

Putting together the last three relations gives 

trS = codim 尺 + trS/i?. ( 招’） 

Subtract (460 from (46); since we have shown in (45) that trPT/Z = tvS/R, we 
obtain the trace formula (42) for the index. D 

Even when G is too crude a pseudoinverse for F to make S and T of trace class, it 
could happen that for some positive integerand are trace class: 

Corollary 15 ; . The spaces U, V, the operator F, G are as in theorem 15 t and S n 
• and V x Ifftrace^lassTirsome positive integer. Then 

indF = trT， 1 一 trS”.. . • (47) 

Proof. We replace the pseudoinverse G by G„ = (E2 一 1 T^)G. Then 

G„F= 涅十 F = (E Ti ) (I - ” = I - T' ⑽) 


0. 
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where we have used the first relation in (41). Using both relations in (41) we deduce 
that TG = GS. Using this relation repeatedly, we get_ 


(? 


FG„ =F G = FG =(I-S) 




-S' (48 , ) 


Now we apply theorem 15. * . -. - - . . -C -- —— 


If formula (47) holds for one value ofit holds for all larger ones. This seems 
peculiar, until we do 

of T is an eigenvalue of S. with the same multiplicity. 


Theorem 15 and its corollary can be very useful in calculating the index of oper¬ 
ators: see Gilkey. 


30.8 THE HILBERT-SCHMIDT CLASS 

The last exercise of this chapter summarizes the main properties of the Hilbert- 
Schmidt (HS) class of operator in Hilbert space. 

Exercise 11. A bounded linear operator K mapping a Hilbert space H into itself 
belongs to the HS class if for some orthonormal basis [ej] of H, 

ElIKQll 2 <oc.. (49) 

(a) Show that if K satisfies (49) for one orthonormal basis, then it satisfies (49) 
for every orthonormal basis, and the sura in (49) is independent of the basis. 
The square root of this sum is called the //S-norrn, denoted as ||K||// 5 . 

(b) Show that ||K|| < \\K\\ hs . 

(c) Show that if K is HS, so is its adjoint K*. and ||K *||"5 = ||K ||"。 

(d) Show that the HS operators form a complete normed space in the //5-norm. 

(e) Show that if K is HS and B is any bounded operator, then BK and KB are 
^5, and \\BK\\hs, IIKBIU 5 are both < ||B|| \\K\\ hs . 

CfJ Show that K is HS iff < 00 . 

(g) Show that every H S operator is compact. • 

fhj Show that every trace class operator is HS. 

(i) Show that the product of two H 5 operators K and that H is in trace class, and 
||KH|| lr <||K|| ws l_ s . 

(j) Show that every trace class operator can be written as a product of two HS 
operators. 
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30.9 DETERMINANT AND TRACE FOR OPERATOR 
IN BANACH SPACES 


The earliest developments of a determinant theory for operators in a Banach space 
are due to Lezanski in 1953, Grothedieck in 1956, and Sikorski in 1961. The ear¬ 
liest derivation of the trace formula for a class of operators in a Banach space is 
due to Konig; a systematic approach has been developed by Pietsch in a*series of 
publications culminating in his monograph. 

Yet another, still broader, systematic approach is presented in the recent excellent 
monograph of Gohberg ， Goldberg, and Krupnik. 
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SPECTRAL THEORY OF 
SYMMETRIC, NORMAL, 
AND UNITARY OPERATORS 


In this chapter we study operators M that map a complex Hilbert space H into itself, 
that are bounded, and that are symmetric in the sense that M* = M. According to 
the definition of adjoint this means that for all x and y in H 

(Mx,y) = (x,My). _ (1) 

Exercise L Show that 

(a) The inverse of an invertible symmetric operator is symmetric. 

(b) The product of commuting symmetric operators is symmetric. 

(c) The set of symmetric operators is closed in the weak topology for operators. 

In chapter 28 we saw that every compact symmetric operator has a complete set 
of orthonormal eigenvectors. In this chapter we generalize this result to include sym¬ 
metric operators that are bounded but not compact. To show how to do this, we 
reformulate the spectral resolution of compact operators. 

Denote by [e n ) the eigenvectors of the compact operator A. Every vector x in the 
Hilbert space H can be expanded in a Fourier series, and so can Ax: 

X = AjC = 6/j. ( 2 ) 

Denote by E n projection onto the eigenspace with eigenvalue / W7 . Then (2) can be 
rewritten as 


jc = XE„.v. Aa* = Za,iE/|.y. (2 ; ) 

We rewrite the sums in (2) as integrals by introducing the projection-valued measure 
E(S) as follows: for any Borel set S of E. 
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E(5)= ^E (! . 


The support of the measure E is the spectrum of A. Using the measure E defined 




< ■ ■■ I ' fl'T ' ■ 


arbitrary bounded symmetric operators M. The projection-valued measure E that 
enters the resolution is, of course, no longer a pure point measure. 

The following result is as basic as is simple: 

Theorem 1. ForB bounded symmetric ， (B.r, v) is a bounded, skew symmetric form 、 
linear in x，skew linear y. 

Conversely, let b{x,y) be a skew symmetric fonn 


linear in x t and bounded: 


Then b can be expressed as 


b{y y x) = b(x,y% 


v)l <c||.r|||iy||. 


b(x y y) = (.x. By) 

where B is a bounded symmetric operator, and 


Proof. The direct part is a consequence of the symmetry of B, 
equality, and the boundedness of B. To deduce the converse, we 
b{x, y) as a bounded linear functional of .r, with bound c|| 7 1|. Accort 
Frechet representation theorem, we can write this functional as a sc: 



b{x. v) = (x. w). 


ermined by >*; setting x-w\n (5) shows that ||tu|| < ci|.y||. 
ides of ( 8 ) depend skew linearly on y and w, respectively, it 
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This proves (6) and (7). The symmetry of B is a consequence of skew symmetry ofb: 

(x, B )，） = h(.x. y) = biyTx) = (v. Bx) = (Bx, v). 

Note that (x. B.v) = (Ba*, x ) is real for all .v. 口 

31.1 THE SPECTRUM OF SYMMETRIC OPERATORS 


Theorem 2. The spectrum of a bounded, symmeiric operator M on a Hilberi space 
is real. 


Proof. We have to show that every nonre^l a = a +/ 卢， ^6^0. lies in the resolvent 
set. Define the function B as follows: 

B(x. y) = (a*. (M — 入 )•、，)• 

This function has all three properties listed as the hypotheses of theorem 6 of chap¬ 
ter 6 : 

(i) B is linear in x, skew linear in v. 

(ii) B is bounded; for by the Schwarz inequality 

\B(x.y)\ < |U1||(M~ ； 0v!l < WM(||M|| + W). 

(iii) B(y. y) is bounded from below, 

B()\ y) = (y, (M - X)y) = (v, Mv) - cx(y\ y) ~ //3(y. v). 

The first two terms on the right are real，the third imaginary; so 
|B(v.v)l> |/mS(V.Tj| = |/!|||y!| 2 . 

We appeal now to the Lax-Milgram lemma，theorem 6 in chapter 6 , which asserts 
that every linear function C(x) can be represented as B(x. y) for some y. uniquely 
determined by £. Take = Hi): there is a uniquely determined y such that 
B(x.y) = (.v.z) for all x. Using the definition of 5 as (.v, (M — X)y), we conclude 
that (M—A) v = Since c is arbitrary, this shows that (M — 入 ） is invertible. Therefore 
a belongs to the resolvent set of M. □ 


n the context of Gelfand’s 
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a pair of symmetric operators. Then 

dist (a(M). a(N)) < ||M-N||, (12) 

where the distance of the two closed point-sets a(M) and a(N) is defined as the 
larger of the rwO quantities 

，--:-二.心 --max：. 一 min |v - max min |v — fji\. (13) 

_ v in a(N) \i in o-(M) /i in <r(M) v in cr(N) 

Proof. Denote ||M — N|| by d. Suppose that one of the quantities in (13), say the 
first, is > d. Then for some v in a(N), 

……— : —.' (f4)； ' 

Such a v belongs to the resolvent set of M, and so M — vl is invertible. According to 
the spectral mapping theorem 

tr((M — ul 厂 】）=(cr(M) — v) -1 . 

It follows from this and (14) that 

|<j(M - vl)-<d (15) 

Since (M — vl) 一 1 is a symmetric operator，its spectral radius equals, according to 
theorem 3, its norm. So it follows from (15) that ||(M — ul)_ 3 1| < d 一 J . 



I|K|!<||(M- vI)、||N - _ < d 一 = h 


It follows that the second factor I + K is invertible by the geometric series. The first 
factor (M — pI), too, is invertible, so is their product N — vl. But this contradicts v 
being in the spectrum of N. □ 


31.2 FUNCTIONAL CALCULUS FOR SYMMETRIC OPERATORS 


i— 




gW = ci n X n + . • • + flo. 


Then, if M is symmetric, so is 

々 (M) = a n M n + . ■ • + 卯 1. 

According to the spectral mapping theorem, theorem 5 of chapter 17, 
cr(q(M)) =q(cx(M)). 


Combining this with formula (9) of theorem 3, we deduce that 


\\q(M)\\ = max |^(a)|. ( 

Xscr(M) 

Let f(X) be any continuous real-valued function on a(M), the spectrum of M. 
can approximate f(X) uniformly by polynomials on cr(M), for we can extend / 
tinuously to an interval containing a(M). Now according to the Weierstrass app 
imation theorem, / can be approximated uniformly on this interval by polynom 
So there is a sequence {q n }, such that 


lim max 


1/(入）1(入)1=0. 


It follows that is a Cauchy sequence: 

iim max ( 入 ）一 ( 入 )I = 0. 

m ， rt—oo 


It follows then from (160 that 


lim ||^(M)-^(M)||=0. 

mm—oo 

Since the bounded operators are complete, lim„-^ 2 o^/i(M) exists. We denote 
limit by /(M). Its properties are summarized in 


Theorem 6. f /(M) is an isometric isomorphism: 


(i) (/ + g)(M) = /(M) + 尽 (M), (fg)(M) = 

(H) li/(M)|| = max a(M ) |/( 入 )|. 

(iii) f(M) is symmetric anda(f(M)) = /(cr(M)). 
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: l/wl 


lim max I 分 , ，（入 )|. 
/2-»-oo a (M) 


These two together, combined with (16'), give (ii). 

(iii) The adjoint of a limit of maps is the limit of their adjoints. Since each p(M) 
is symmetric, it follows that so is f(M). Since /(M) is the uniform limit of q n (M), 
it follows from 02) of theorem 5 that cr(/(M)) is the limit of a(g n (M)). Thus (iii) 
follows from (16). This completes the proof of theorem 6. □ 

Recall from chapter 3 8 that a symmetric operator M is called positive if 

-- - - - -fo r alt A 7 in H. . .. 

Theorem 7. A bounded symmetric operator is positive if and only if its spectrum 
contains only nonnegative numbers: 


a(M)>0.. 

Proof, (i) Suppose that cr(M) > 0. The function f(X) = y/k is continuous for 
入 > o, which contains a(M). The functional calculus defines VM = N as a sym¬ 
metric operator, which satisfies = M. Therefore 


(Mr,x) = (N 2 x, x) =： (Na% Nx) > 0. 


\1 is positive. 




SPECTRAL RESOLUTION OF SYMMETRIC OPERATORS 361 

the square root of any positive symmetric operator, we can remove compactness and 
assert: 

Eveiy bounded operator in Hilbert space has a polar decomposition. 

31.3 SPECTRAL RESOLUTION OF SYMMETRIC OPERATORS 

According to the Riesz representation theorem, every bounded linear functional i 
on the space of continuous functions / defined on the compact space cr(M) can be 
described as the integral of / with respect to a uniquely determined measure m(5 ； 
defined on the Borel subsets S of a(M), whose total variation is finite. We use now 
thp fnnrtinnn! ralr.nl 115; rlescrihed in theorem 6 to construct the functionals 




since both measures represent 









362 SPECTRAL THEORY OF SYMMETRIC, NORMAL, AND UNITARY OPERATORS 
conclude that 

..I4y(/)l = |(/(M)a-oOI < ll/(M)||||A-||||v|| = |/| max ||A-||||.v||. 

This shows that |4 V | < WIILvIl，so (iii) follows. 

(iv) According to part (iii) of theorem 6, for / real the spectrum of /(M) is 
.?° f° r positive functions /, / (M) is a symmetric operator whose spec- 
trum is positiv e. Therefore according to theorem 7, /(M) is a positive operator. This 
"IhcTws that thelinear functionai 4., (/) = (/(M)x, x) is positive; but then so is the 
measure m XtX representing it. □ 

一 - Borel-subset of ^(M) r is-a bounded r skew- 
symmetric sesquilinear functional of x and v. We conclude from theorem 1 that for 
each S there is a bounded symmetric operator E(5) such that 

^.y,vW = (E(5)a*,v). (20) 

This family of operators has the following properties: 

Theorems Let E(S) be the family of maps defined by (20)，where m x , x (S) is de¬ 
fined by (18). 


(i) E*(5) = E(S). 一 
Hi) i|E(5)|| < 1. 

(iii) E(0) = 0, E(a(M)) = I. 

(M //5nr = 0, E(s un = E(S) + E(r). 

(v) Each E(5) commutes with M. 

(vi) E(SDT) = E(5)E(D. 

Each E(5) is an orthogonal projection. If S and T are disjoint，the range of 
E(5) and E(T) are orthogonal. 

('蝴 All orthogonal projections E(S), E(T) commute. 

Proof. (i)is part of theorem 1. Part (ii) follows from part (iii) of theorem 8. 

(iii) Since m A - t V (0) = 0, it follows from (18) that E(S) = 0. On the other hand, 
setting f(X) = 1, /(M) = I in (18) gives for all .v and 

(.v.y)= / dw XtY = (E(ct(M))a% j), 

Jct(M) 

which means thatE(cr(M)) = I. 

Part (iv) follows from the additivity of the measure m Xt y. To show (v), we note 
that since M commutes with /(Mj, and is symmetric, 


(/(M)Mxo’）= (M/(M)jc, y) = (/fM)x, My). 
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The functional of / on the left is represented by the measure mjvtc. v ； the functional 
on the right by the measure m x% Mv- Since the functionals are the same，so are the 
measures: 

Setting this into (20), and using once more the symmetry of M, gives 

(E(5)lVL^y) = (E(5).r, My) = (ME(S)x } y). 

Since this holds for all x and y, E(5)M = ME(5), as claimed in (v). 

We postpone the proof of (vi) to section 31.5. 

(vii) Setting S = T in (vi) shows that E(S) = E 2 (5), namely that E(5) is an idem- 
potent. The geometric expression of this algebraic fact is that E(5) is a projection. 
Since, by part (i), E is symmetric, E(5) is an orthogonal projection. It follows from 
(iii) and (vi) that if S and T are disjoint, the range of E(5) and E(T) are orthogonal. 
Part (viii) results from interchanging S and T in (vi). □ 

The family of operators E(S) is an orthogonal projection-valued measure. 

Exercise 3. (a) Show that E(S) is countably additive in the strong topology. (Hint: 
Use the orthogonality of the ranges of E(5) and E(7) when S and T are disjoint) 
(b) Show that E(5) is not countably additive in the norm topology. 



Theorem H is a Hilbert space M •• H — H a bounded, linear symmetric op¬ 
erator Then there is a uniquely determined orthogonal projection-valued measure E 
on the spectrum ofM such that E(S fl 7 1 ) = E(5)E(r) and 

/(M) = f f(X)dE (21) 

Ja(M) 

for all continuous functions f ona(M). The integral exists in the norm topology. 

Proof, The meaning of (18) and (20) is that (21) holds in the weak topology. To 
show that it holds in the norm topology, it suffices to show that the Riemann-Stieltjes 
1 兩西 riti 晒 rd 涵贼 TnllTat- topology.Tffis can be* done in'tfie siffidaFr 
fashion, combined with the estimate 

< max |a y I, 


where Ulj = a(M) is a decomposition of a(M) into a finite number of disjoint 
pieces Ij. The estimate follows from the orthogonality of the ranges of E(/y). The 
uniqueness of E(5) follows from the uniqueness of the scalar measures (18). □ 
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Take- / = 1, and f CAj s X ： 

1= [ dE. M = f kdE: 

. . . •• Jo(M) 

(22) is called the spectral resolution of M. 


-—— J ： U 一 ABSQLUT-EL\U5 QN - m 勝 US 、 棚 GUL - A I^ANB~PQiNT-SPEemA 

We give now an important further refinement of the spectral resolution. According 
to the Lebesgue decomposition Theorem for measures, any measure on R can be de- 
: 二 : 而历分 0 每 d as41ie sum of d, point measure, supported on a denumerable set, a singular 

measure, supported on. a set of Lebesgue measure zero, and an absoluiely conrimtous 
measure with respect to Lebesgue measure. 

We apply this to the measures m Xt y = (E.v. v): 

m X \y = + /77 .\\v + • （ 23) 

From the uniqueness of the Lebesgue decomposition we conclude that all three mea¬ 
sures on the right in (23) depend linearly on x. skew linearly on v. For any 5 these 
sesquilinear functionals can be represented as 

= (E (p) (S)x,y), m^y(S) = (E (s) .\\ y) and == (E (rl .\*. v). 

These bounded, symmetric operators E ip \ E {s) . E lc) have all properties listed in 
theorems 9; that is. each family is a projection-valued measure, each orthogonal to 
the other. 

Denote the ranges of E^fcrfM)), E ⑴ （ cr(M>)，and E fc ‘) （ a(M)) as H is \ 
and they are called the point, singular^ and absolutely conrimtous subspace of 
H with respect to the operatorM. Clearly, 


31.5 THE SPECTRAL REPRESENTATION 
OF SYMMETRIC OPERATORS 

Spectra] representation is an infinite dimensional analogue of the diagonal form of 
symmetric matrices. 

Theorem 10. For any vector x in the Hilbert space H，and any coniinuous func¬ 
tion /, 


( 22 ) 


||/(M).v|! 2 = I \fi}.)\ 2 dm x , x . 


(24) 
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Theorem 11. There exists a family {Kj} of closed subspaces ofH such that 


(i) The K j are pairv\dse orthogonak and they span H: ■- . 

// = 心 ㊉ 尤 2 ㊉•… (25) 

(ii) Each .Kj.is invariantuiider .M and is^pectraUyj'epr£sentedMsJLzliiij.). .. 

Such a collection of [Kj] is called a spectral representation of M on H• 

_easj.jo. derive from a spectral representation of M its spectral resolution. 

For any measurable set S, define on Hi 色 Zij 涵 as 一 foIfoWsi in the 

representation of K j as L 2 (mj\ E(S) is multiplication by c s (X), the characteristic 
function of S, defined as 


⑽） = 


0 


if 入 in S 
otherwise. 


Exercise 5. Verify that E(5) as defined above is a spectral resolution of M, namely 
that the family {E(S)} is a projection-valued measure that has all the properties listed 
in theorems 9 and 9' - - ———. . . 


Although a given operator has many spectral representations, the spectral resolu¬ 
tion derived from each is the same, as explained in theorem 9\ 

We turn now to the last topic of this section: spectral multiplicity. To simplify 
the discussion, we take the case that the underlying Hilbert space H is separable. It 
follows that in any spectral representation (25) the family {ATj} is denumerable. 

For further simplification we assume that the spectrum of M is absolutely con¬ 
tinuous with respect to Lebesgue measure in the sense explained in section 31.4. By 
the Radon-Nikodym theorem, such a measure is equivalent to Lebesgue measure on a 
subset S of R. Under this assumption the measures my entering the spectral represen¬ 
tation of theorem 11 can be taken to be Lebesgue measure over sets Sj that support 
the measure, where the support of m is the union of points A that have the property 
that the restriction of m to any open interval containing 入 has positive measure. 

The following are easily verified: if [mj] is a denumerable collection of measures, 
the support of Em j contains the union of the supports of/ 77 y. 

Two measures are singular with respect to each other if the intersection of their 
support has Lebesgue measure zero. 

Lemma 12. Take any two spectral representations (25) and (25’）of a bounded, 
symmetric operator M, with absolutely continuous spectrum, acting on a separable 
Hilbert space: 


H = K[@K2@..., K'j ^ L 2 (S'j). 


(250 
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We claim that up to sets of measure zero 

USj =US f j. 

Proof. Take any index fc; choose x to be that vector in K'k that is represented by 
the function s 1 in Sk- For any continuous function /， 

(f(M)x y x)= f /( 雜 . (26) 

JSk 

Denote by x ； j the projection of x onto K’j, and denote by gj function rep¬ 
resenting Xj in L 2 (Sj). Then x = "Exj, and 

(f(M)x,x) = S(/(M)«) = f(k)\ gj \ 2 dX = f f(k)\g\ 2 dX, (26') 

where \g \ 2 = S|g;| 2 . 

Both (26) and (260 represent the same bounded linear functional. Therefore the 
representing measures must be equal: 

c Sk dk = \g 2 \dX. 

The support of the measure on the left is Sk, and the support of the measure on the 
right is contained in US’】. Therefore Sk C US’】. Reversing the role of (25) and (25 , ) ? 
we obtain lemma 12. □ 

Definition. The spectral multiplicity of a point 入 in a spectral representation (25) 
T,c Sj (X), is the number of sets Sj to which a belongs; Sj is the support of the yth 
measure my in the representation (25). The spectral multiplicity of X can be zero, 
any natural number, or oo. 

A spectral representation is far from unique, since its construction contains many 
- arbitrary choices. So it is far from clear that the spectral multiplicity function has 
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Splitting. S uppose that a subspace K of H is represented spectrally by L 2 (m), 777 
some measure on M. Splitas the sum m = 'Em j of measures that are pairwise 
singular with respect to each other. Each L 2 (mj) represents some closed subspace 
K .i of their direct sum A ：】 ㊉ ㊉ • • • is a spectral decomposition of K. 

Combination. This is the reversal of splitting. Suppose that {A ： y) is a collection 
of pairwise orthogonal subspaces of H. each represented spectrally by L~(wj )： 
Suppose that the m j are pairwise singular with respect to each other. Then K = 
. 尺 1 ㊉ 尺 2 ㊉ •. • is represented spectrally by L 2 (m), m = Em/. 

We will use splitting and combination to rearrange any spectral representation 

H =： j(\ ® K 2 ㊉ …， Kj 分 L^(Sj) 

into a standard form, as follows: Decompose each set Sj as 
Sj = U5j', 

where Sj is that subset of 5 ； which belongs to exactly k of the sets Sj. Define the set 
M] as 

M \ =U5j. (27) 

Clearly, M j is the set of points that belong to exactly one set Sj, namely points of 
spectra] multiplicity one^in the spectral representation under discussion. Similarly 
we define as 一 

.M k =US k ?, (2/) 

Clearly, M k is the set of points of multiplicity k. Moc is the set of points of infinite 
multiplicity. It follows from the construction that 

USj =UM k UMoo- (28) 

By splitting each Sj and recombining them into sets Mk as described above, we can 

3p]it and recombine the spectral representations Kj L 2 (Sj) into spectral repre¬ 
sentations by L 2 (M 々 ），/: = 1 ， 2 •… ， 00 . There is exactly one subspace represented 
by L-(M \), call it H\. There are two orthogonal subspaces represented by L 2 (M 2 ), 
whose direct sum we cal] Ho^. and so on. The subspaces are orthogonal to each 
other; so we obtain the spectral decomposition 

// = //! ® // 2 © … ® //oc ， (29) 

where H k is represented spectrally by 々 copies of L 2 (M/ ; ). We call (29) the standard 
form of the spectra] representation (25). 

We claim that any other spectral representation (25。has the same standard form. 
Here is the proof: Denote the standard spectra] representation obtained from (25') as 

H = H; ㊉ ㊉..•㊉ Hl^.. 


(29，) 













370 SPECTRAL THEORY OF SYMMETRIC, NORMAL, AND UNITARY OPERATORS 

Combining this with the disjointness of M\ and M^k > 1, we conclude that Mi is 

contained in My Reversing the role of the two spectralx^resentations^J^-conclaide - 

that M[ is contained in M\ ; therefore M\ = M^- 
To show that H\ = H[, we argue as in the proof of lemma 12. We leave it to the 
diligent reader to show that for all k, Mk = Mj. and Hk. = Hj.. □ 

Exercise 6. Let (25) be a spectral representation for M, -o- L 2 (mj)ySj -the-^up-- 
port of nij. Show that the closure of the union of Sj is the spectrufli of-M. --- 

Exercise 7. Give an example to show that the spectrum of M may contain a set of 
positive Lebesgue measure whose multiplicity is zero. 


Definition. Two bounded, symmetric operators M and N acting on a Hilbert space 
H are called unitarily equivalent if there is a unitary map U } that is. a one-to-one a 
norm-preserving map of H onto H, that carries M into N: 

N==UMU _1 . 


Theorem 14. Two bounded, symmetric operators M andN whose spectrum is abso¬ 
lutely continuous are unitarily equivalent if and only if they have the same spectral 
multiplicities. 


Proof. IfM and N are unitarily equivalent U carries the standard spectral repre¬ 
sentation of M into a standard spectral representation of N. Conversely, if the spec¬ 
tral multiplicity set Mk and Nk for the two operators M and N are the same，then the 
standard spectral representations for M andN furnish the unitary map U. □ 


31.6 SPECTRAL RESOLUTION OF NORMAL OPERATORS 


A bounded, linear operator N mapping a Hilbert space into itself is called normal if 
it commutes with its adjoint: 


N 才 N = NN *， * (30) 

Clearly, every bounded symmetric operator is normal; as we will see in the next 
section, so is every unitary operator. 

The spectral resolution of normal operators is analogous to that of symmetric 
operators except that the spectrum of a normal operator may well contain complex 
numbers. 

Theorem 15. Let H be a Hilbert space, N : H 一 H a normal operator. Then there 
is an orthogonal projection-valued measure E on the Borel subsets of the spectrum 
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Since every invertible operator commutes with its inverse，it follows from (38) 
that U and U* commute. This shows that every unitary operator U is normal; thus ， 
according to theorem 15, U has a spectral resolution of form (31), only in this case 
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Proof, (i) Every measure is uniquely determined by its Fourier coefficients. It 
follows that since the left side of (43) depends linearly on x, so does the right side. 

(ii) is an immediate consequence of the definition (44) of the measure m x ,y. 

(iii) Take any set S on the unit circle; we claim that m Xi y(S) is a scalar product 
in H. By (i), it is linear in x\ by (ii), it is a skew symmetric function of x and )». 
Since w XiX (S) = m x (S), it is nonnegative. Therefore the Schwarz inequality can be 
applied: 

|m a .. y (S)| < 

We saw earlier that the total measure m x is ||i'|| 2 . Since m x is nonnegative. m x (S) < 

ll^^T^^irvllVaha-sonm^v^ CT 

Note that theorem 17 is a literal analogue of theorem 8. 

Let S be any Borel set on the unit circle. It follows from theorem 17 that m x ， y (S) 
is a skew symmetric function of x and )，，linear in x, and bounded by ||x||||y||. It 
follows then from theorem 1 that it can be represented as 

m x . y (S) = (E(S)x,y) i (45) 

E(5) a bounded, symmetric operator. We claim that this family of operators has the 
same properties as those listed mlheorein 9 . Here, for instance, is a demonstration 
of property (vi): 

(vi)E(5nr) = E(S)E(r). 

Proof. We set (45) into (43), obtaining 

(U n x, y) = J e inB d(Ex, y). (46) 

Replace n by n + k: 

(V' ,+k x.y) = J e ine e ike d(]Ex.yy, , (47) 

on the other hand, using (46) with x replaced by U^a*, we can express the left side of 
(47) as 

J e il,e d(EU k x, y). (48) 

Two measures on the unit circle that have the same Fourier coefficients are identical; 
therefore 


e ike d(Ex, y) = d(EU k x, y). 






rewrite the right side of (49) as {V k x, E(S)y)\ using formula (46)，with k in place of 
n, we find that this is equal to 

J e ike d(Ex, E(S)y). (490 

The measures on the left in (49) and (49’）have the same Fourier coefficients; there¬ 
fore they are identical: 

c s d(Ex,y)= ： d(ExMS)yl 
Integrate both sides over any other Borel set T: 

J c T c s d{Ex, y) = (E(T)x, E(S)y). 

Since c T cs = c S nT^ the left side is (E(5 n T)x, y). Using the symmetry of E(5), 
we find that the right side is (E(S)E(T)x, y). Since they are equal for all x and y, 

E(5nr) = E(5)E(D, 

as asserted in (vi). _ □ 

We leave it to the reader to verify the rest of the properties listed in theorem 9. 
Setting (45) into (43), we obtain 

(U n A-,y) = J e ind d{Ex, y), 

which is the weak version of 

U n = f e i,td dE. (50) 


spectral resolution of unitary operators. 

historical NOTE. The spectral resolution of bounded symmetric operators is due 
to Hilbert. The spectral representation, and theory of spectral multiplicity, is the work 
of Ernest-Hellinger, (1883 - 1950)，a student of Hilbert. He was professor of mathe¬ 
matics at the University of Frankfurt until his dismissal by the Nazis. In the infamous 
anti-Jewish pogrom in 1938, dubbed “Krystallnacht，” he was taken to the dreaded 
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32 

SPECTRAL THEORY OF 
SELF-ADJOINT OPERATORS 


In this chapter we present the spectral theory of unbounded self-adjoint operators. 

We start with the observation, due to Hellinger and Toeplitz, that an operator M 
that is defined everywhere on a Hilbert space H and is its own adjoint, 

(IVLr, y) = (x, My), ⑴ 

is necessarily bounded. 

We show first that M is a closed operator. Suppose that x„ is a convergent se¬ 
quence, .v„ jc, and that converges to some vector u. Setting .r = x„ into (1), 
we get 

(]S/Lx n t y) = (x n . Mv), 

and in the limit we get 

(it,y) = (jc, Mv). 

By (1), the right side equals (Mv, 3 ；). Smce.this holds forjill )，, w =JMv. This proves 
that M is closed. But then, according to the closed graph theorem, theorem 12 of 
chapter 15, M is bounded. 

It follows from this observation that unbounded operators that are their own ad- 
__joints mn hp Hpfinprl_nnl_y_Qn n 卬 hspace-olHiIb£rLspace-HereisJjie_precise_definL-— 
tion, due ..to.. y.oa Neumann :…… . . . •.… 

Definition. Let // be a complex Hilbert space, D a dense subspace of and A a 
linear operator defined on D. The adjoint A* of A is the operator whose domain D " 5 
consists of all vectors v\n H for which there is a vector denoted as A*u in H such 
that 

(. 2 ) 
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{Au, v) = («’A’u) 
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holds for all u in D. Since D is dense, for any given v there can be only one such 
vector A*v. Clearly, D* is a linear subspace of H, and A* is a linear operator on D*. 
A is called self-adjoint if D* = D and A* = A. 


32.1 SPECTRAL RESOLUTION 

The main result to which this chapter is devoted is the spectral resolution of self-_ 

adjoint operators: 


Theorem 1. Let A be a self-adjoint operator in a Hilbert space H; denote the do- 

: - 成 ■ J Fhereris~a^pecmtresottttkmibr~ArthaH^rorthogonatpraj^ction^ - 

valued measure E defined for all Borel measurable subsets ofR, with the following 


properties: 


(i) E(0) = 0, E(R) = I 

(ii) For any pair of measurable sets S and T t E(S H T) = E(5)E(r). 

(iii) For evejy measurable set S, E*(S) = E(5). 

(iv) E commutes with A, that is, for any measurable set 5, E(S) maps the domain 
D of A into D t and for all u in D, AE(S)u = E(S)Au. 

(v) The domain D of k consists of all vectors ufor~which 


and 


J ⑴ K, «) 


Aw = 


J tdE(t)u. 


⑶ 


⑷ 


There are a number of proofs known for this important result an extension of 
theorem 9 of chapter 31. The first one ever was given by von Neumann, and we 
will sketch it in section 32.2. Yet another approach will be indicated in section 32.3. 
A proof due to Marshall Stone will be outlined in chapter 34 on semigroups. The 
beautiful proof that we present here in all its details is due to Doob and Koopman. 

According to the Herglotz-Riesz theorem, theorem 6 of chapter 11, every analytic 
function /(f) in the unit disk, |f| < 1 whose real part is positive, can be expressed 
uniquely as 

m = ic + f (5) 

where m is a nonnegative measure of finite total mass on the unit circle and c is real. 
We can change the scene from the unit disk to the upper half-plane and obtain the 
following variant: 








SPECTRAL RESOLUTION 


Theorem 2. Every analytic function g(z) in the upper half-plane whose imaginary 
part is positive can be expressed uniquely in the form 


g(z) 



ds(t), 


(5，) 






SPECTRAL THEORY OF SELF-ADJOINT OPERATORS 






SPECTRAL RESOLUTION 


Given any measure n on R, real or complex, of finite total mass, formula ( 
defines a pair of analytic functions g in the upper and lower half-plane; g is calif 
the Cauchy transform of the measure n. 


isfonn (8) is one-to-one, i.e. a complex measure 
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Let u n be a sequence of vectors in the range of A—cl that converges to some vector w: 
Avii'-zvn ^ U n , 

It follows from inequality (11) that ||v n - v m || < l/|Imz|||wn - w n ||. Therefore also 
the v„ converge to some limit v. We claim that this limit u is in Z). To see this we take 
the limit of the above relation as n oo. The right side tends to «, and the second 
；i term on the left tends to — zuv Therefore-the'first term also tends to a limit, call 


r-zv-u. . (12) 



adjointness of A: 


(Au„, iu) = (v tu Aw). 

Take the limit as « oo: 

(r, w) = (v. Aw). 

By definition of self-adjointness, this shows that v belongs to the domain D of A, 
and that Av = r. Combined with (12), this shows that u belongs to the range of 
A — zj, so the range is closed. 

If the range of A — zl were not all of H, there would be a nonzero vector k in H 
orthogonal to it: 

(Au — zv, k) = (Av,k) — (i\zk) = 0 

for all v in D. By definition of self-adjointness, it follows that k belongs to D, and 
Ak = zk. But then (k, Ak) = z(Jc, k) is not real, contraiy to the symmetry of A. 

This completes the proof that A — zl maps D onto H. That it is one-to-one follows 
as above, for otherwise some k in D would be mapped into 0 by A — zl，contrary to 

^ □ 


(13) 

Corollary 1. R(z) is an analytic function of z on the resolvent set of A. 


( 11 ). 

We denote the resolvent of A by 

R(z) = (A-sI 厂 1 . 
It follows from (11) that for 2 nonreal 

||R ⑴ Bllnur 1 . 


Proof. Choose any vector u in H, and denote R(z)u as u(z). By definition of R, 
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Similarly 

(A - (z + h))v(z + h) = u. 

Subtract these two equations and divide by /z, 

, A Mz + h)-v(z) ,, ,、 

(A - 4 ) -:- = v(z-\-h) y 


which is the same as 

v(z + h)- v(z) 


R(z)v(z + h) = R(z)R(z + h)it. 


Using the estimate (13), we conclude that v depends Lipschitz continuously on 4 . 
Letting /z 0, we deduce that i>(s) is differentiable in the complex plane, so R(z) 
is holomoiphic in the strong topology. □ 


We claim that the adjoint of R(z) is R(z). To see this, choose any two vectors u 
and w in H. Denoting R(z)« = u, (A — z)v = ££, we have 

(« ， R(z)iy) = ((A - i)u, R®iy) = (u, (A - z)R(z)w) = (v, w) = (R(z)w,u?). □ 

For any u in H we define the complex valued function g(z) for z nonreal as 
follows: 


g(z) = (R(z)w ， m). (14) 

To indicate its dependence on w, we will write g(z) = g u (z) when necessary. 

Lemma 6 . For any u in H, g defined by (14) has these properties: 

(i) g is an analytic function of z in the upper half-plane Inu > 0 , and its imag- 
inaiy part there is nonnegative ： 

UO < Ifwll 2 - 

(iii) limv-^oo y g u (iy) = ll“ll 2 . 

(iv) g(z) = i(z). 

Proof. The analyticity of g follows the analytic character of R(z). Denote R(r)« 
by u; then 

« = (A ~ z)v = Au - zv. (15) 

Take the scalar product of (15) with u; since by (14) g = (u, w)> we get 


(v, Av) v). 
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Since A is self-adjoint (v, Av) = (Au, u) = (v, Av) is real, so we deduce from (16) 
that 


Img(z) = v(u, i;.), . y =Imz_ _ _ (17). 

This proves that the imaginary part of g is positive in the upper half plane, as claimed 
in (i). 

(ii) By the Schwarz inequality. . ...„ . _ - 一 d.• 

I 办⑵ I = mz)u,u)\< iiR( S )«iii| U |i< iiR(z)Hii»iF 
using inequality (13) for!|R ⑵ ||，we obtain (ii). 

—■_(iii) Take the scalar product of (15) with u: since by g„ = (v. u). we get 

||w|| 2 = (Au, uj-zg lt (z). 

Set z — /y and take the real part of this relation: 

||m|| 2 = Re (Ai? ? u) + vim g u (iy). (18) 

To complete the proof of (iii) f we have to show that the first term on the right in 
(18) tends to zero as y tends to oo. To see this, we first estimate this term by the 
Schwarz inequality: 

|Re(Au^TJII < |(Au, w )| < 

To see that i|Au|| tends to zero, we write 

Au = AR ⑵ w = (I + zR(z))u. 

Setting z = iy and using inequality (13)，we get the estimate 

l|AR(n0li<l+jilRO*y)||<2. (19) 

This shows that the operators AR(/j) are uniformly bounded. Clear 】 )' ， it! suffices to 
show that AR(iv)u tends to zero for a set of u dense in H. D is such a set, for then 
by (13) 

||AR(/.v)w|| = l|R(/v)Aw|I < ||R(/y)III|Aw|| < ||Aw||/v. 

(iv) Since R*U) = R(z), we have 

g(z) = (R(z)u,u) = (R*'(z)u.u) = (w, R(z)u) = (R(z)w, u) = g(z). □ 

Lemma 6 shows that for any u in H the function 玄 (z) defined by (14) satisfies the 
hypotheses of theorem 3. Therefore we conclude that for Imz positive g(z) can be 
represented in form (8): 
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The nonnegative measure n depends on the vector u, which we indicate as n = n tl . 
It follows from part (iv) that the representation (20) holds in the lower half-plane as 
well. 

We deduce from relations (9) and part (iii) of lemma 6 that 

.«,‘(R) = NI 2 . ⑼ 

For any pair of vectors u and v we can express (R(z)w, v) as a linear combination 
of (R(-)(« ± u), (u ± iv)) and (R (: ：)(« 士 〜)， （《 士 啡 This leads to an integral 
representation of (R(-:)m, v) as a Cauchy transform: 


(R(z)u y v) 



dn ll%v . 


( 22 ) 


The measure n u . v has a simple expression in terms of n u ± v and 士 / v ; see formula 
(44) in chapter 31, section 31.7. 

The properties of the measures n tl , v are summarized in 


Lemma 7. 

fO iht.u = 

(ii) n u , v depends linearly on u, skew-linearly on v. 

(iii) n lltV is a skew-symmetric function of u and v: n v ^ u== n ttv . 

(iv) The total variation ofn UtV is < H«|| \\v\\. 


The proof is identical to the one given for lemma 7 in chapter 31, section 31.7: ii 
is based on the simple explicit expression for n tl , v , □ 


We appeal now to theorem 1 in chapter 31. A bounded, skew symmetric, skew 
bilinear functional b(u, v) in a Hilbert space H can be represented uniquely as 

b{u, u) = (Ew, vh 

where E is. abounded syjnmetric.operator.acting on H. Lemma 7 says that for any set 
S' n(S) t ,, v is such a functional; therefore there exist bounded, symmetric operators 
E(5) such that 

_ nnJS) = msyu^ _ _ 


Setting this into”(21), we obtain 


(R(c)«, v) ■ 


-<r/(Ew, u). 


(24) 


We claim that the operators E defined by..(23.).furnish the spectral resolution for A, 
meaning that they have the properties proposed in theorem 1 at the beginning of this 
chapter. 
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Proof, (i) Since by (23) ， n{&) UyV = 0 for all w, u, so is (E(0)w, u); this makes 
E(0) = 0. On the other hand, combining (21) and (23) ， we.get that for all vectors u, 

(E(R)w. u) = n u (R) = ||w|| 2 = (m, w). -- 


It is easy to deduce from this that E(R) is the identity. 

(ii) We show first that the operators R(z) and E commute. To see this，we start 
with the fact that for arbitrary nonreal complex numbers z andix; the operators R(s) 
and R(iu) commute.-Therefore for any pair of vectors w and v in. ， - - 

(R(iy)R(s)w, u) = (R(z)R(iy)w, v). (25) 

We-use-tlie adjoint of-R ( 切 ) -te-rewrite theleft-side-of- fS •另印 
resentation (24) with R*(u;)v in place of u yields 

(R(z)w ， R* ⑽ u) = -^—d(Eu, R*(iu)u) = f —d(R ⑽ Ew ， u). (26) 

J t — z J t 一 z 

We rewrite the right side of (25) by employing the representation (24) with R(w)u 
in place of u. We get 

f 丄 rf(ER ⑽ w ， u). (260 

J t 一 z 

The Cauchy transforms (26) and (260 are-identical functions of z. Therefore, by 
lemma 4, the representing measures are identical: 

(R(iu)E(S)m ， u) = (E(S)R(w)u, v) 

for every measurable set S. Since this holds for arbitrary u and u, it follows that 


R(w)E(S) = E(S)R(w) 

for arbitrary set S and any nonreal complex number w. 

To show that E(S A T) = EOS)E(r)，we use the resolvent identity 


R ⑵ R(u;)= 


R(z) - R ⑽ 

2 — U) 


to rewrite (25); using (24) twice for both z and w yields for the right side of (25 )， 


/ (忐 - rb ) 卿， u ) = / (忐) ( rb ) 雜， ll )’ （27) 

We compare now (26) and (27); appealing once more to lemma 4, we conclude that 
the measures appearing in these two formulas are the same. Therefore for any mea¬ 
surable set 5, 


(R(iy)E(5)w, u)= 


^-d(Eu,v). 


(28) 
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where c 5 (t) is the characteristic function of the set S. We use now formula (24)，with 
win place of z and E(5)w in place of w，to rewrite the left side of (28) as 

f —^—d(EE(S)u, v). (28’） 

J t -w 

We compare (28 ; ) with the right side of (28); we appeal once more to lemma 4 to 
conclude that the measures appearing in these formulas are identical: for any mea¬ 
surable set T, 

(E(T)E(S)u, v) = j? c s (t)d(Eu, v) = (E(5n T)u, v). 

Since this holds for arbitrary vectors u and v, it follows that E(r)E(S) = E(S H T)， 
as asserted in (ii). 

(iii) The symmetry of E follows from the skew symmetry of n WiU (5). 

(iv) We have already shown that E commutes with R = R ⑵ • To show that it 
commutes with A, let v be any vector in the domain of A; then v can be written in 
the form v = Rw ，u in H. Using the identity AR = I + zR, we get 

EAv = EARw = Em + zER«. 


Similarly，since E and R commute, we can use the same identity to write 

AEu = AERw = AR(Ew) = Ew + ^REw = Em + zERu. 

Comparing the last two identities we conclude that EAu = AEu, and that E maps D 
into D. 

(v) Suppose that u belongs to the domain D of A. Then, since .R(z) maps H onto 
D, we can write n as u = R(z)w. It is convenient to choose z == i. Let T denote any 
measurable set. Using previously established properties of E and R and the resolvent 
identity, we can derive the following string of identities: 

(E(7>，v) = (E(DRm, Ru) = (R*RE(D«, u) = (R (- f)R(/)E(7>，“) 一 

=-~R(-i)mT) U ,u) 

- - - =.1. / [ 1 . 1 7ptEEt?>rrf) - 

=f — -rd(EE(r)w, m) = [ - - rd(Eli, u). 

J l-bt 2 1 + f 2 ' 

This proves that 

d(Ev, v) = v = R ⑴ u， 
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and therefore 

(] + t 2 )d(Ev, v) = d(Eu t u). 

It follows that as asserted in (3) s for v in D, 

J rd(Ei\ v) <oo. (3) 

Next we show that for u in D the Riemann integral 


rdEv 


-•(4)- 


with respect to the vector-valued El> measure converges. To see this, consider any 
finite interval S of M. and any decomposition of 5 = U5 / into disjoint subintervals, 
each of length less than 1. A Riemann sum corresponding to this decomposition is 

^r/E(5；)u = J2TjVj, tjeSj. (4，) 

where vj abbreviates E(Sj)v. It follows from property (ii) that the vectors vj are 
pairwise orthogonal, so the norm square of the Riemann sum (A 1 ) is 

Since this is a Riemann sum for the integral (3), it is bounded uniformly for all 
decompositions. The convergence of the integral (4) follows. 

To determine the value of this integral, we take any vector w in H. Let T be any 
measurable set. Using previously established properties of E and R, we can derive 
the following string of identities: 


(E(T)v t w) = (E(T)Ru. w) = (RE(7>， vj) 


This proves thaj: 


and therefore 




d(Ev. w) = - cl(Eu, v)), 

t-z 


cl(Eu y vj). 


(r 一 z)d(Ev, vj) = d(Eu. w). 
Setting u = (A — z)v on the right above gives 


!c/(Ev. uj) = d(EAi\ w). 


SPECTRAL RESOLUTION USING THE CAYLEY TRANSFORM 


389 


Integrating this relation over E，we deduce that 

J td(Ev t w) = (Au. w). 

The left side is scalar product of the integral (4) with w; since w is arbitrary, it follows 
that 


J tdEv = Au. 

□ 


(29) 

Exercise 1. Prove relation (29). 

We sketch now two other approaches to constructing a spectral resolution of a 
self-adjoint operator. 


This completes the proof of theorem 1. 

Corollary. Ifx belongs to the domain ofA n , then for every v in H, 
(A k x t v) = J t k d(Ex, v), k < m. 


32.2 SPECTRAL RESOLUTION USING THE CAYLEY TRANSFORM 

We present now von Neumann’s original approach employing the Cayley transform 
of A: 

U = (A-i)(A+ /)-'. (30) 

We urge the reader to recall from chapter 31， section 31.7. the notion of a unitary 
operator. 

Theorem 8. The operator U is unitary, that is, a norm presenting mapping of H 
onto H. 

Proof. Since.the..pperators A 士 i . map D. CALQne-iOrone_ 

itself. We claim that U is norm preserving. To see this, let u be any vector in,..//.; . . _ 

denote by v and w the vectors 

u = (A + i)^ ] u, w = U«. 


Then 


(A + Ou = u. (A — i)v = w. 
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Taking scalar products and using the symmetry of A, we get 
IMI 2 = ((A+i)v, (A+/)u) = ||Au|| 2 +|M| 2 +/[(i; ， AiO_(Au ， i;)] = l|A W || 2 +||u|| 2 , 
and similarly 

__ .|| U )|| 2 = ((A - i)v, (A - i)v) = ||Au|| 2 + Hull 2 . 

TRil^Toves IKITtnrM 而 preserving. ' ' □ 

Then von Neumann appeals to the spectral resolution of the unitary operator U in 
上以 mj 良 QtijDJiz^aiiiejlm^sujcfejmJthejinitLcinde; pulled back .to die. xeal.axis 
by it — i)/(t + /) = e ld , this furnishes the spectral resolution of A. 

32.3 A FUNCTIONAL CALCULUS FOR SELF-ADJOINT OPERATORS 

The resolvent set of A consists of all numbers z for which A — d maps D one-to-one 
onto H. The spectrum of A is the complement of the resolvent set. According to 
theorem 5 S the spectrum of A lies on the real axis. 

Exercise 1. Shows that the spectrum of an unbounded self-adjoint operator is a 
closed, unbounded set on the real axis. 

The extended spectrum of an unbounded self-adjoint operator is its spectrum com- 
pactified by adjoining oo. 

In this section we will define /(A) for every / that is continuous and real valued 
on the extended spectrum of the self-adjoint operator A. 

With start with a few observations about powers of a self-adjoint operator. For any 
natural number k, the domain of A^*. denoted as £)(A^*), consists of all vectors x for 
which x, Ax, ..., A k ^ l x lie in D, the domain of the operator A.. 

Exercise 2. Show that A k is a self-adjoint operator. 

Lemma 9. For any self-adjoint operator A acting on a Hilbert space H, 







To see this, let v denote 






A FUNCTIONAL 


belongs to D. For k = 1, this is what we want to know; for A: > 1 we write y = 
(A + /I)j ： and rearrange it as 

Aa- = )； - ix t (31) 

which shows that x belongs to D(A 2 ). For k = 2 、 this is what we want to know; for 
k >2, let A act on (31), and use (31) on the right: 

A 2 ;c = Ay - iAx = Ay — iy — x. 

This shows that a* belongs to D(A 3 ), and so on, until we place x in D(A k ). Ditto for 
We write now 

(A 2 +ir n = [(A—fir^A+fir 1 ]' ( 32 ) 

Since the k factor (A ± zl) 一 1 maps D(A* 一 h into D{A k ) t the In factors on the right 
in (32) map H into D(A n ). □ 

Exercise 3. Show that (A 2 + 1) 一 n maps H onto the domain of A 2n . 

Let q(k) be any polynomial of degree < In. It follows from lemma 9 that 
分 (A)(A 2 +1) 一 n is an everywhere defined, bounded operator. 

Exercise 4. Show that if the coefficients of q are real, q (A) (A 2 +I) _/I is symmetric. 
(Hint: First show that the domain of A 2n is dense in H.) 

The following version of the spectral mapping theorem holds: 

Lemma 10. Let q denote a polynomial with real coefficients of degree < 2n and 
abbreviate q(X){X 2 +l)'~ l as r(X). The spectrum ofr(A) consists of all real numbers 
a of form a = r(X), A. in the extended spectrum'of A. 

Proof. Write . 

r(X)-cx= {q(\) - a{k 2 + l)"] (X 2 + l) - ". 

The numerator has real and complex zeros, in conjugate pairs. Factor it as 

. k . . 

… r(X.)-a = Y{((.X.-pj) 2 + ui?j) fl ( 入 -k) ( 入 2 +l)- n . (33) 

1 ^ 1 


k 2 (/:-/:) 

"• (A) — aI =n((Aui) 2 +M5i)n (a - 入離 2 +1)-' 


Then 
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The partial product . 

.— fl(A~pji) 2 + ^ 2 ji)(A 2 +ir k 
1 

is a one-to-one map of H onio H. The-remaining factor is a one-to-one map of H 
^— . onto, iff all. the ^.belong to the resolvent set of A, and if the total number of zeros 

_of the numerator is 2n. Since it follows from (33) that / •( 入 £) = cr，the two conditions 

above for the invertibility of r(A) - cr7 can be stated so: r (X) ^ a for any real 入 in 
the spectrum of A, and r(oo) ^ a. □ 

- AeeQi^ifl^te^emma-Q-afid-exefetse^T-fbr-the-rationai-functtons r^) - considered 

above, r(A) is a bounded, symmetric operator. We appeal now to theorem 3 of chap¬ 
ter 31: ||/-(A)|| equals the spectral radius of r(A). According to lemma 10 the spec¬ 
trum of r(A) is the range of r ( 入 ）on the extended spectrum o*(A) of A. Therefore 

||/-(A)|| = > max^^ |r(X)|. (34) 

The rational functions r described above form an algebra over the reals. We claim 
that they separate points of the extended real line. Clearly, ifl\ and 入 2 have the same 
sign, ( 入 2 +1) 一 1 has different values at 入 】 and A. 2 ； ditto if or 入 2 is 00 . If 入 ] and X 2 
have opposite signs ， 入 ( 入 2 + 1)— 1 separates them. Furthermore the constant function 
belongs to the algebra. We appeal now to the 

Stone-Weierstrass Theorem. An algebra of real valued, functions on a compact 
Hausdorff space that separates points, and contains the constant functions, is dense 
in the space of all continuous functions, nonned by the maximum norm. 

For a proof see section 3 of chapter 13. 

It follows that the rational functions r(A) defined above are dense in the space 
of all continuous functions / on the extended spectrum of A compactified by the 
addition of the point 00 . That is, every such continuous function can be approximated 
uniformly by a sequence of rational functions: 

lim ;*a- = /. 

It follows that {/*/：) is a Cauchy sequence: 

max |r*(A) — 0. 

X in or(A) 

We apply now (34) and conclude that ||" 々 (A) — r^(A)|| tends to zero as k, l tend to 
00 . The norm limit of the sequence of operator /. 々 (A) is defined as /(A). 

It is easy to verify that the functional calculus we have just defined has all the 
properties listed in theorem 5 of chapter 31. This functional calculus can be used, 
just as it was done in chapter 31, section 31.3, to construct a spectral resolution of 








the operator A. A spectral representation can be built along the lines of chapter 31 ， 
section 31.5. 


NOTES. The first unbounded operators for which a spectral theory has been de¬ 
veloped were differential operators, ordinary and partial. These will be discussed ， 
lightly, in chapter 33. The first general spectral theory for unbounded integral opera¬ 
tors was developed by Carleman, in the context of singular integral operators. 

In the bibliography we list the early contributions to the spectral theory of self- 
adjoint operators. 
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EXAMPLES OF SELF-ADJOINT 
OPERATORS 


The definition of self-adjointness demands that the domain of such an operator be 
specified with the greatest precision. This is possible in some cases, but for most par- 
tial differential operators with variable coefficients defined on domains and subject 
to various boundary conditions, it is not possible—and not useful—to give an ex- 
act description of their domain. Instead, such operators are defined by some suitable 
process of extension. In the first part of this chapter we describe such processes. 


33.1 THE EXTENSION OF UNBOUNDED SYMMETRIC OPERATORS 

Definition. An operator C is called an extension of operator B if the domain of C 
contains the domain of B and Cu = Bm on their common domain. 

We are given a linear operator B mapping a dense subspace D(B) of a Hilbert 
space H into H that is symmetric: 

(Bu, v) = (m,Bu) (1 ) 

for all u and v in Z)(B). We pose the following questions: 

(i) Is it possible to extend B to a self-adjoint operator? 

(ii) In how many ways? 

(iii) By what process? 


We recall the notion of a closed operator as one whose graph in if x H is closed. 
We spell it out: 

Definition. An operator C mapping a dense subspace Z)(C) of H into H is closed 
if for every sequence {«，] in D(C) that converges to a limit u in N and for which 
[Cu n ] converges to a limit winH,u belongs to the domain of C and Cu = w. 




— 





兹縦獅纖獵纖二 V ‘二:二 ' : 〆 '“: V 魏 r 
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)Let z be any complex number, u any vector in the domain of B, and .denote by ^ 
vector 

(B-z)« = /. 

the scalar product with u: 


(Bw, u) - z(u, u) = (/,«)• 
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-rm on the left is real; smce the imaginary parts of 


|lnu| = .. ... 

It follows from this inequality that 

• . : . . ll^ll 5 —~~'-II/II ； - -—— (2-) - 

|im z\ 

this implies that the operator C — z is one-to-one. 

be a g 却监 mie 一 Qf—vectors in_tbe _can ge_Qf 五 —7 that cnnvp.rgps to fmH: 

(B-z) u n = /„. (2') 

It follows froin(2) that fw„} converges to some limit u in H. Bu 丄 then，by (2’)，so does 
Bw". Since B is a closed operator, w belonss to the domain of B and R/v = f 4 - yu 





EXTENSION OF SYMMETRIC OPERATORS; DEFICIENCY INDICES 397 

To see why this is so, we abbreviate 

(A-z) _l f = x, (A -z) _l g = y\ (4) 

then we can rewrite (3) as 

(x, (A - z)y) = ((A - z) x, y). (4 ’） 

Since A is symmetric, this is valid for all x and y in the domain of A. Since A — z. 
and A - z map D(A) onto H, it follows that (3) holds for all / and g in H. 

We are now ready to prove that A is self-adjoint. What we have to show is that if 
v belongs to the domain of A*, then v belongs to the domain of A and A !f! v = Au. 
Now v belongs to the domain of A*, with A*u = if for all x in D(A), 

(Ax, v) = (x t w), (5) 

Subtracting v) from both sides yields 

((A-z)x, v) = (x, w-zv). 

Using the abbreviation (4) and relation (3), with g == w ~zv,we can rewrite this as 
(f ， v) = ((A — 2 )—i/’io-b) = (/, (A-zr'iw -zv)j . (5’) 

Since (5) holds for all x in D(A), (5) ; holds for all f in H; it follows therefore that 
v = (A-z)^ 1 (w-zv). 

Since the range of (A - - 1 is D(A), it follows that v belongs to D(A): acting 
on both sides by A — z shows that Av = w. Since w was defined above as 
A*i; = Av. □ 

NOTE. In proving the converse, we used only the assumption that some nonreal z 
and z belong to the resolvent set of A. 


33.2 EXAMPLES OF THE EXTENSION OF SYMMETRIC OPERATORS; 
DEFICIENCY INDICES 

~We turn now to some examples that iiriistraSThe notioirdFcTfc^iiFe of a symmetric 
^^atOfraiid th 色 jjJ6 茲 ib—ilfties Of self-adjoint! extensions. 

Example 1. 

Definition. Denote by H the Hilbert space L 2 (E), and define the operator B as 
/ (d/dx) acting on the domain D(B) = Cq consisting of all once differentiable func¬ 
tions on R with compact support. 
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Proposition. B is symmetric, and its closure B is self-adjoint. 

Proof. Integration by"parts shows that B is symmetric. Let z be any compleiTnum- 
ber. The range of B —4 •consists-of~all"functions / of form - - 

.d , 

i jz 11 -zu- f, u € C 5 . ⑹ 


~M!ultiply by * -- - - .•- — —• - - --……- 

(e izx u) = e izx f. (6') 

Integrate over K; since u has compact support, we get 

0 = / e lzx fdx, ⑺ 

J 一 oo 

a condition satisfied by every function / in the range of B — 2 . Conversely，every C 0 
function / on E*that satisfies (7) belongs to the range of B — 2 . To see this, define 11 
by 


u(x) = -i r e iz ^'- x) f(y) dy. ( 8 ) 

~ J-oo 

Clearly, u has continuous derivative, and if / is zero outside a compact interval 5, it 
follows from (7) and (8) that m too is zero outside S. 

The function e lzx is not square integrable on R; therefore the set of continuous 
functions / of compact support that satisfy condition (7) is a dense subset of L 2 (jR). 
According to part (iii) of theorem 1, for z nonreal the range of B — z is closed; so 
the range is all of H. Since B — z is one-to-one, z belongs to the resolvent set of B. 
According to theorem 2, it follows that B is self-adjoint. „.— 

A symmetric operator whose closure is self-adjoint is called essentially self- 
adjoint. □ 



Definition. Denote by H the Hilbert space L 2 (R+)，and define the operator B as 
i{d/dx) acting on the domain D(B) = Cq consisting of all once differentiable func¬ 
tions whose support is a compact subset of ( 0 , oo). 

Proposition. B is symmetric, but its closure B is not self-adjoint. Furthermore B 
has no self-adjoint extension whatsoever. 


Proof. Symmetry of B follows by integration by parts, since all functions in the 
domain of B are zero near 0 and oo. Arguing as in example 1, we conclude that a 
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continuous function of compact support on E + belongs to the range of B — z iff 
/ *00 

0= e izx fdx. (9) 

Jo 

For Imz < 0, the function e tzx is not square integrable on 1R+; therefore the set of 
Co functions / satisfying (9) is dense in L 2 (E + ) = H. It follows from theorem 2 
that the range of B — zis all of H. 

It is otherwise when Imz > 0, for then the function e izx is square integrable 
on R+，and therefore the range of B - z 一 consists of all f in H that satisfy the 
orthogonality condition (9). It follows that B is not self-adjoint. 

To see that B has no self-adjoint extension, we note that according to corollary 1, 
such an extension A would have to be an extension of B as well. Let v be a function 
in the domain of A that does not belong to the domain of B. Choose any z with 
Im z < 0; since B — z maps D(B) onto H, there is function u in D(B) such that 

(5 - z)w = (A-z)u. 

Since A is an extension of B, 

(A -z)(v- u) = 0. 

This is impossible unless v—u = 0, for A is symmetric, and so, according to theorem 
l,A — z has no nonzero null vector for complex z. □ 

Example 3. 

Definition. Take H = L 2 (0,1)B = i(d/dx) acting on functions u in £>(B) = Cq 
consisting of continuously differentiable functions on [0,1] that vanish at a* = 0 
and ,1. . 

Proposition. B is symmetric, but its closure B is not self-adjoint. However, B has 
self-adjoint extensions. , 


Proof. Symmetry of B follows by integration by parts. Arguing as before, we see 
that the range of B — z, Imz # 0, consists of all Lr functions / that satisfy the 
orthogonality condition 



According to theorem 2, B is not self-adjoint. 

We construct now some self-adjoint extensions of B. 
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-2. 


Cayley transform of U ， 

A = i(i+u)(i-ur 1 , (ioo 

defined on the range of I — U, is a self-adjoint extension of C. First we have to show 
that U has a Cayley inverse, namely that I - U has no nullvector. To see that I 一 U 
annihilates only the zero vector, suppose that (I — U)n = 0. By adjointness, for any 
vector y, 

0 = ((I-U);7 ， y) = (MI-U*) ： y) ; 

it follows that n is orthogonal to the range of I — U*. Since U is unitary, this equals 
the range of (I — U*)U = U — U*U = U — I. 

From formula (10) we get that V — I = 一 2/(C + /I)— 1 , According to theorem 
1， the range of (C + fl)— 1 is the domain of C, a dense subspace of H, Since U is 
an extension of V, the range of I — U contains the range of I — V, and so it too is 
dense in H. This proves n — 0, and so I — U is invertible. Since according to ( IQ’} 
the domain of A is the range of I — A, it follows that A is densely defined. 

Next we show that A is symmetric. Let u and u be a pair of vectors in the domain 
of A. By definition (10 ; ) of A, 

(AH.uj-z^a+^a-ur'u.u). 

By adjointness, this equals 

/(MI-UT’d + U、). 

Since U is unitary, U* = U _1 ; so the above can be rewritten as 

/ («, a - u - 丨 )- 1 (i + = ( (w,(u- ir'ud+u- 

=/(«,(U- I)~'(U + I)u) = (« ， Au). 

In the last step we used the fact that in formula (10’）defining A, the two factors 
commute. 

To show that A is not only symmetric but self-adjoint we use theorem 2 and ver¬ 
ify. that every nonreal complex number z belongs to the resolvent set of A. Using 
definition (100 of A, we write 



•、 — 〆 ' 


A-;I = j(H-U + iz(I-U ： 
We saw in chapter 31, section 3 
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REMARK. The deficiency indices are nonnegative integers or oo. We assume that H 
is separable, so there is on^H^tie-ldnd of co. - 

Exercise 3. Prove part (i) of theorem 3. 

Exercise 4. What are the deficiency indices of Examples L 2, and 3? 

… Theorem 3 has this important - --- 


Corollary. Let K be a Hilbert space over the reals, and B a densely defined sym- 
metric operator on K. Such a B has a self-adjoint extension to tlie complexiHcafion 



Proof. The complexification of K is H = K iK; there is a natural extension 
of B to if. There is a natural complex conjugation in w 4 - iv = u — iv 9 u and v 
in K. Conjugation commutes with the action of B. 

Denote by C the closure of B; C too commutes with complex conjugation. It 
follows that the range of C - zl is the complex conjugate of the range of C — n. It 
follows that the codimension of the range of C - zl equals the codimensions of the 
range of C — n. This proves that the deficiency indices of C are equal; so according 
to theorem 3, C has a self-adjoint extension. □ 


33.3 THE FRIEDRICHS EXTENSION 

In this section we describe an enormously useful method, due to Friedrichs, of con¬ 
structing a self-adjoint extension of a large class of symmetric operator, such as 
Schroedinger operators. The way this extension is carried out imposes certain bound¬ 
ary conditions automatically. 

Definition. A symmetric operator L defined on a dense subspace D of a Hilbert 
space H is semibounded (from below) if 

c||«|| 2 < (U,hu) (11) 


for some constant c and all u in D. 

In what follows we will take the constant c to be 1; this can be accomplished by 
augmenting L by a sufficiently large multiple of the identity. We define on D a new 
scalar product, denoted by (u, u) L , as follows: 

(u,w)l = (^Li/). (12) 


The symmetry of the operator L guarantees that (v, w)l is Skew symmetric, and 
semiboundedness, with c = 1, shows that (w ， w) L is positive. We define the L-norm 








II«IIl = «)l 2 - 

[t follows from (11) with c = 1 that the L-norm of every u in D is bigger than 
original norm: 

ll«|l < IMIl. (1 

The subspace D is, in general, not complete in the L-norm; we can complete 
Denote its completion by it consists of equivalence classes of Cauchy sequent 
in the L-norm. By (11’)，a Cauchy sequence in the L-norm is also a Cauchy sequer 
in the norm of H. Since H is complete, such a Cauchy sequence has a limit in 
: his defines a natural manning of Hr into H. 


























Exercise 5. Show that the inverse of a symmetric operator is symmetric. 


We give now some examples of semibounded operator and their Friedrichs exten- 


Example4. H = L 2 (0,1)，L = -(cl 2 /dx 2 ) + cj, q some continuous function on 
[0,1]. The domain of L is Cq( 0, 1). Since every u in D(L) is zero at the endpoints, 
integration by parts gives 


=( W ,L«) = f (.u 


Clearly, inequality (11) is 


with c = min^. Let us assume that c 


Proposition. Every function in Hl Is continuous on the closed interval [0, 1] and 
vanishes at the endpoint. 


Proof, For every u in Cq, we deduce using the Schwarz inequality that for every 
a, b in [ 0 , 1 ] that 


\u(b) - u(a)\ = / u x dx\ < Jb — a ( / u 2 x dx } < Vb^a\\u\\ h . (18) 


It follows that a Cauchy sequence in the L-norm converges uniformly, and that the 
limit u in satisfies (18) and is zero at the endpoints. □ 


zero Dirichlet boundary conditions on ^functions in the dom; 


Exercise 6. Show that the closure of the operator L in exan 


Exercise 7. H = L 2 (0 f 1), L = — {cl/dx)p(d/dx) + q, wl 
-tion-irreVfHn-GVlhe-domain-of-t-is-G^QrBT*-Show-that € 




-tioiritr 6 ^/-in- 67 the-domain-of^lT-i^G L 2 -( 0 rl-)--Show-that every «-in the domain-of 
/ 一 is ， eontinucms 、 in，[Orl ]-and is zero at the endpoints. 


Example 5. G a bounded domain in the x, y plane, H = L 2 (G), the space of square 
integrable functions in G, L = -△ = 一 (3^ + 3?), the domain of L is the space 
Cg(G). 


Proposition. When G has a smooth boundary, evefy function in the domain of h F 





Exercise 8. Prove this proposition. (Hint: Show that ||w||^ = up dx dy.) 

■Exercise 9. • Showthat the operator L*in example 5'has deficiency indices 06 , 557 一 

33.4 THE RELLICH PERTURBATION THEOREM . .. . 

In this section we present a result of Rellich that says，roughly speaking, that if we 
add to a self-adjoint operator A a symmetric operator T that is not too large compared 
to A then the sum A + T is self-adjoint. Here is the precise result: 











is w in Z)(A) that is mapped into v: 

(A + ic)w = v. 

Setting this into (20) and choosing u = w gives 

(Aiy + icw t Aw + ics) + (Tw y Aw + ics) = 0. 

Estimating the second term by the Schwarz inequality yields after an algebraic ma¬ 
nipulation 

||Au; + fciy|| 2 < ||Tiu|| 2 . 

Since A is symmetric, the left side of equal ||Aiy|| 2 + c 2 ||w;|| 2 ; the right side is 
bounded by (19): 


l|Aw|| 2 + c 2 \\w\\ 2 < a 2 \\wf + b 2 \\Aw\\ 2 . 


















1 enough, the coefficient of ||Bw||- in (24) can be made < 1; this sho 

一 一 )‘ We appeal now to corollary 5 to 

□ 
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l|T«l| 2 <2ell«ll 2 + (_ + 2)( 去 il»lP +|||B H || 2 ) . ■(28) 

Since € is arbitrary, we can take it so small that the CoefBd6ftr6r||BiT|f 2 一 i!T(287B- 
less_than_l. So an inequality of form (19) is satisfied for all u in D(B). By corollary 
5, 5 + T is self-adjoint. □ 

Exercise 12. Carry out the details of the proof that B is essentially self-adjoint. 

33.5 THE MOMENT PROBLEM 


In chapter 14, section 14.7, we have formulated the 

Hamburger Moment Problem. What sequences aQ,ai,... of real numbers can 
be represented as moments of a mass distribution on E: 

a n = / t n dm, (29) 

m 

where m is a nonnegative measure whose support is larger than a finite set of points. 

As observed in chapter 14, every sequence ao, a.},... of form (29) is Hankel pos¬ 
itive, which means that the quadratic form 

Q = (30) 

is positive for every nonzero choice of real numbers 专 i ， . • • ， f"，// arbitrary. This is 
evident from the formula 

Sanuhh = f Xt n+I %!kdm; j (Jit'%) 1 dm. — (30，）. 

Clearly, the right side of (300 is nonnegative. Furthermore, when the support of the 
measure in is not finite, it is not confined to the zeros of the polynomial this 

shows that the right side of (30’）is positive. Note that it follows that ao is positive. 

Hans Hamburger has shown that this necessary condition for the a n to be repre¬ 
sented in form (29) is also sufficient: 

Theorem 6 (Hamburger). Let [a n ] be a sequence of real numbers such that the 
quadratic form (30) is positive. The the a n are the moments of a nonnegative measure 
on the real axis M; that Is，they can be represented in the form (29). 


Proof. Denote by D the linear space of all finite sequences of real numbers: 

x = ( 夸 o ， $i ， … ，与 n ， Q ， …). 
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Define the scalar product (x, y) on D by the quadratic form Q in (30): 

U, y) = y) = Ia n+ /^ n Vk. (31) 

Denote the completion of D with respect to the norm \\x\\ = Q ] ^ 2 (x,x) by K, and 
denote by /? = if + / 尺 the complexification of K. 

Define on D the operator R as right shift, that is, 

R^ = (0,^o^l >■•.). (32) 

Denote by e the unit vector in D: 

e = (1,0, 0,…). 

Then 

R% = (0, …， 0，1，0...)， 

and it follows from the definition (31) of the scalar product that 

{e^ l e)=a n . (33) 

R is symmetric, for we can write, denoting n — 1 as €, 

(Rx, y) = 'Lan^Hn-X^k — E 御 +AH-1& 耿 . (34) 

The sum on the right is, clearly, a symmetric function of x and 
Since R is a symmetric operator acting on D, a dense subspace of the real Hilbert 
space K, it has, according to the corollary of theorem 3, a self-adjoint extension to 
the complexification of H of K. Denote by E the projection-valued measure that 
gives the spectral resolution of such an extension of R. According to theorem 1 of 
chapter 32 ， 

_ _ {e,R' l e )±： J t' l d(Ee,e)\ 

combining this with (33)，we get the sought-after representation (29)，with m = 
(Ee ， e). □ 


NOTE.^ItJopk Hamburger 150 pages to prove his theorem. Using the theory of self- 
adjoint operators, it takes less than a page. 

A related problem is the Stieljes moment problem used to represent a sequence of 
real numbers as the moments of a mass distribution on the positive real axis IR+: 

a n = [ t n dm. (35) 


4 
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The positivity of the quadratic form Q defined in (30) is certainly a condition neces- 
sary for a representation of form (35). So is the positivity of 

Sa n+Ii +\H (36) 

This is eyidentirom.the formula— 

一 ― 一一岛二 d&.=7 = 〆 t{Y,t n ^ n ) 1 dm. 

： : " jw^t ― jr + . . . . 

The positivity of (36) can be expressed by setting )，= jc in (34) and writing 

> 0 . . 
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Theorem 8. 


_(? ) Th.p. moment problem (29) ha s a unique solution iff the operator R is essen- 

tially self-adjoint, that is, has a unique self-adjoint extension. 

(if) The moment problem (34) has a unique solution iff the operator^ R Q 
unique nonnegative self-adjoint extension. 

Neither part of the theorem is obvious.HBven-if-R-has - two -distinct self-adjoint- 
extensions Ri and R 2 , it is not clear that the measures (Eje, e) and (E 2 ^, e) are 
distinct. On the other hand, when R is not essentially self-adjoint, there are solutions 
m of th^moment problem that are not of theform (Ee, e), where E is the spectral - 
resblMi 而 奶 s _ df-^dj _ oiiireXtetision of k. 

Fora proof of theorem 8, and for a review of the literature of the moment problem, 
we refer to the article by Barry Simon; see also Henry Landau’s article in the AMS 
Symposium volume edited by him, as well as the books by Akhiezer, and Shohat and 
Taniaxkin. 

HISTORICAL NOTES. Stieltjes introduced the integral named after him in connection 
with his work on the moment problem. 

The theory of self-adjoint operator was created by von Neumann to fashion a 
framework for quantum mechanics. The operators in Schrodinger 5 s theory that are 
associated with atoms are partial differential operators whose coefficients are sin¬ 
gular at certain points; these singularities corresEpnd to the unbounded growth of 
the force between two electrons that approach each other. To define such differen¬ 
tial operators as self-adjoint ones is not a trivial task. Examples 5 and 6 presented 
in section 33.4 allow some singularities in the potential q, but the ones occurring 
in quantum mechanics are more singular still. I recall in the summer of 1951 the 
excitement and elation of von Neumann when he learned that Kato has proved the 
self-adjointness of the Schrodinger operator associated with the helium atom. 

And what do the physicists think of these matters? In the 1960s Friedrichs met 
Heisenberg, and used. the occasion to express to hrn the deep gr 里 itu_4? 中? ?9扭二 
munity of mathematicians for having created quantum mechanics, which gave birth 
to the beautiful theory of operators in Hilbert space. Heisenberg allowed that this was 
so; Friedrichs then added that the mathematicians have, in some measure, returned 
the favor. Heisenberg looked noncommittal, so Friedrichs pointed out that it was 
a mathematician, von Neumann, who clarified the difference between a self-adjoint 
operator and one that is merely symmetric. “What’s the difference,” said Heisenberg. 
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SEMIGROUPS OF OPERATORS 


The natural source of semigroups of operators are partial differential equations de¬ 
scribing evolution in time, and flows generated by dynamical systems. In this chap¬ 
ter we present an abstraction of these concrete situations, a point of view initiated by 
Hille. In the next two chapters we will present illustrations and applications of the 
theory. For a detailed treatment of this subject, we recommend Hille-Phillips, Yosida, 
and Goldstein’s excellent monograph. 

Definition. A one-parameter semigroup of operators over a complex Banach space 
X is a family of bounded linear operators Z(f), f > 0, each mapping X X ， with 
the following properties: 

Z(t + 5 ) = Z(t)Z(s) for all t,s >0; Z(0) = I. (1) 


Equation (1) is the multiplicative property of exponential functions. We show next 
that under an additional continuity property, equation (1) characterizes exponential 


functions. 


Theorem 1. 


(i) Let G: X X be a bounded linear map. Define Z(t) to be 
Z(t) = e' G , t>0, 

where the exponential of the operator is defined by the power series 


^ G = E 


t n G n 


⑵ 


⑶ 


Then Z(r) is a one-parameter semigroup of operators, continuous in the norm 
topology for operators. 









[ <a,oy rormuia p;. me multiplicative property (l) implies that Z ⑴ and 
Z(s) commute; therefore we deduce from (1) and (5) that 

L(f +s) = L(0 + L(^), r + ^ < a. 


From this we deduce that for all rational t < a, 厂 1 L(r) is independent off; denote 
this operator by G: 








34.1 STRONGLY CONTINUOUS ONE-PARAMETER SEMIGROUPS 

The most interesting semigroups are not of the form (2) but are associated witiraiP 
ferential equations, of which a typical example is the heat equation r -- 

u t — d^u = 0 , 

u. being a periodic function of ;c, A solution of the heat equationis..uniquely deter^,,,^.. 
mined by. specifying its initial values, namely.its value at /. = 0. The initial value 一 
can be specified as an arbitrary continuous function; the absolute value of the corre¬ 
sponding solution at any later time does not exceed the maximum absolute value of 
the prescribed initial values. Denote by Z(r) the operator relating the initial values 
«'(x；'0) "of soltitrons to u(x， 丁) r€tearlyrtfa:eseroperat o r s ~ form a.ofl e^rametQi^mfezzzr 
group in the sense of (1). Yet they are not of the form (2); for if they were, they could 
be extended to negative values of t to forma group of operators. But it is well known 

that such extension of solutions of the heat equation backward in time is not possible 

in general. 

The heat equation semigroup Z(r) is not uniformly continuous at ? = 0; yet it 
retains a less stringent kind of continuity that expresses the fact that each solution u 
is a continuous function of t: 

Definition. A one-parameter semigroup Z(r) is strongly continuous at r = 0 if 

s 二 —(7) 

for all x in X. 

Theorem 2. Denote by Z(/) a one-parameter semigroup of operators that is strongly 
continuous at t = 0. 

(i) There exist constants b and k such that Z(r) is bounded in nonn by 

|Z(r)| <be kt . ⑻ 

(ii) Z(t)x is a strongly continuous function oft for eveiy x in X. 

Proof, (i) We claim that |Z(r)| is uniformiy bounded in some neighborhood of 
f = 0. To see this, suppose, to the contrary, that there is a sequence^* 0 such that 
\Z(tj)\ oo. By the principle of uniform boundedness (see chapter 14, theorem 7) 
Z(tj)x could not converge to x for all x in X. This violates strong continuity at r = 0; 
therefore there exists an a > 0, > 0 such that |Z(r)| < b for t < a. 

Any t can be decomposed as / = -f r, 0 < r < a. By the semigroup property, 

Z(t) = Z n (a) Z(r). So 

|Z(0l<|Z(fl)nZ(r)|<Z7 ,J+1 <be k \ 

where k = ^ \ogb. This proves (8). 
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(ii) For any pair of positive numbers s < t wq can，by the semigroup property, 
write 

Z(t)x — Z(s)x = Z(^)[Z(? — s)x —x]. 

Combining (7) and (8) strong continuity follows. □ 

Next we show, speaking loosely, that a strongly continuous semigroup can be 
interpreted as an exponential function of an unbounded, not everywhere defined, 
operator. We start with a few facts and notions about unbounded operators: 

Definition. Let D be a dense linear subspace of a Banach space X， G a linear opera¬ 
tor mapping D into X. We call the operator G closed if whenever {^j is a sequence 
of vectors such that x n x and Gx n y, then x lies in D, and G.r = y. D is 
called the domain of G, and is denoted as D(G). 

According to the closed graph theorem, a closed linear operator defined at every 
point of a Banach space X is bounded. The operators we encounter in this chapter are 
defined only on a dense subspace of X and are unbounded. 

Definition. Let G be a closed operator with domain D(G); a complex number f 
belongs to the resolvent set of G, denoted as p(G)，if fl —G maps _D(G) one-to-one 
onto X. The spectrum of G, denoted as cr(G), is the complement of its resolvent set. 

Suppose that f is in the resolvent set of G; then (I 一 G is invertible. Its inverse 
is called the resolvent of G and is denoted as 

R(0 = (fl-G)-*; 

it maps X onto D(G). Since G is closed, so is R(f )； it follows then from the closed 
graph theorem, theorem 12 of chapter 15, that the resolvent R(^) is a bounded oper¬ 
ator. 

Exercise 2. Suppose that the resolvent set of Gis not empty, and that f belongs to 
p(G). Show that a complex number y belongs to the spectrum of G iff (f — y)^ 1 
belongs to the spectrum of R(f )• We can express this symbolically as 

One can think of (9) as an instance of the spectral mapping theorem for unbounded 
operators. 二 

Exercise 3. Deduce from (9) that the spectrum of G is a closed set in the complex 
plane. 


Next we define the transpose of an unbounded operator: 
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Definition. Let G be a densely defined, closed linear operator in a Banach space X. 

爾 -G^'theTelmran . . . . 


..(Ga-, i) = (x, G^). (10) 

The meaning of (10) is this: The domain of G 7 consists of those linear functionals l 
for which the leftTide'ofX~TO): is a 、 饱 

Since Z)(G) irdense \vrX% this-botmtietHinearfimctional-cairbe-extendecHmiqtiely - 

from D(G) to all of X\ this extension is denoted by G^, its domain by D(G l ). 

Exercise 4. Show that the transpose of a densely definediinear operator is closed._ 


The difficulty with the preceding definition is that the domain of G’ is an elusive 
thing. The following result is useful for pinning down the domain of G / . 

Theorem 3. Let X be a reflexive Banach space, G a densely defined, closed linear 
operator mapping D(G) into X, whose resolvent set is not empty. Then its transpose 
G / is a densely defined, closed linear operator mapping D(G f ) into X\ and p(G’)= 
P(G). 

Proof. Let < be a complex number in the resolvent set of G; subtract f (x, i) from 
both sides of (10): 

((G - = (G 7 - fl)£). (10') 

Denote the resolvent of G by R(f) = 一 （ G— (I) 一 1 and define R’(f) as the transpose 
of R((): 

= ( 10 ") 

We claim that ) is the resolvent of G’. We show first that R ; (f) is one-to-one; 
for otherwise there would be a nonzero m such that R’(（）m = 0. Setting this into 
(10")，we conclude that the range of R(f) is annihilated by m\ since the range of 
R(f) is D(G), assumed to be dense, this is a contradiction. Now define R’ 丨 (f) to 
be — G’ + (I' and set R(f )y = x and )m = t into (10"). Thus we obtain (10'). 

The domain of G / as defined above is the range of R’(f). It remains to be shown 
that G’ cannot be extended any further. To see this, we note that the range of G’ 一 fl’ 
as defined above is all of X f . Therefore, if we extend G’ further, G’ 一 (I’ would an¬ 
nihilate some nonzero Setting this into (100, we would conclude that ^ annihilates 
the range of G — f I, a contradiction since f belongs to the resolvent of G. 

The last task is to show that the domain of G'，identified above as the range of 
R/((), is dense in X. If it were not, then there would be a v in X that annihilates 
the range of R ; (f) (it is at this point that we use the reflexivity of X). Setting this 
into (30")，we conclude that R (())， = 0, contrary to the fact that f belongs to the 
resolvent set of G. □ 
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Exercise 5. Show that in the Hilbert space setting the conclusion of theorem 3 has 
to be modified as follows: ^(G*)= 泛 (G). 

Definition. Let Z(t) be a strongly continuous one-parameter semigroup of opera¬ 
tors: X — X ， meaning that (1), (2), and (9) are satisfied. Its infinitesimal generator 
G is defined by 


G.r = ^ - 


lim 

/i -^0 


Z(h)x - x 


( 11 ) 


the domain of G, denoted as D(G)，consists of all x for which the strong limit (11) 
exists. 


Theorem 4. Let Z(t) be a strongly continuous one-parcimeter semigroup, G its in¬ 
finitesimal generator. 

(i) G commutes with Z(t) f in the sense that if x belongs to D(G), so does Z(t)x, 
and 


GZ(t)x = Z(r)Gx. 


( 12 ) 


iii) The domain of G is dense. 

f iii) The domain of G n , n any natural number’ is dense, 
f iv) G is a closed operator. 

“V All complex numbers f whose real part is > k belong to the resolvent set of 
G, where k is the constant appearing in inequality (8). The resolvent of G is 
the Laplace transform of Z. 


Proof, (i) Using (1), we can factor the difference quotient in two ways: 


Tj{t + h) — Z(f) 


x = Z(t) 


Z(A) 




(13) 


-vr'hen-r-belongs to D(G), the middle term converges as /z 0 to Z(t)Gx. Therefore 
: he terms on the right and left converge also, and we deduce from (13) that 


(14) 


^?5revEyTin D(G )； This proves part (i). 

: ii) We claim that an integrated form of (14), 


Z(t)x 一 x = G Z(s)x els. 


( 15 ) 
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Riemann sums. To prove (15), we want to evaluate the action of G on this integral. 
_ _ Letting. .Z(/0- ac t on the integrand, and using the semigroup property, we get 

乂 Z(s)x ds = \ j Q [Z(^ + h)x - Z(s)x] dx 

. . . - - i - y»/4-/j—. — 1 ph 

= 7 / Z(s)x ds — T Z(s)xds. 

" 一 二二 ^ 二 二二 ； … h Jq - 

Since Z(s)x is strongly continuous, the terms on the right converge to the left side 
of (15); this proves that for any a* in X, Jq belongs to jD(G), and that (15) 

.. … ...holds. . 

- ^~it"foltewsifronrtte-strong~eonthmity*of-Z-that-forany^c-itt-^T - - 

. 1 • 
lim - / X(s)x ds ■= x\ 

^0 t J Q 

this proves that D(G) is dense in X. 

(iii) We argue similarly about the domain of higher powers of G. Denote by 沴 any 
infinitely differentiable function on E supported on [0,1]. For any x in X we define 

== j (p(s) Z(s)xds. 

The same argument as above shows thatx^ belongs to the domain of G, and that 
Gx^ = - J" ip\s)Z{s)xds. 

Clearly, belongs to the domain of all G ；I . We choose now a sequence of (pj that 
are nonnegative, satisfy / (p jds = 1, and whose support tends to zero. Appealing 











continuous semigroup, and that its infinitesimal generator is G—fl , 、 
the generator of the original semigroup. We apply (15) to the raodifi 


•e 二 P 一 - f e^ s -2,{s)x ds. 


Suppose that Re f > k; as / -» oo, the left side tends to -x. The integral on the 
right tends to L(f).r; since G is a closed operator, we conclude that 

This shows that L(f) is a right inverse of (f I — G). To. deduce from this that L(f) 
is the inverse of f — G，we use (150 in place of (15). [ 


As we will see in chapters 35 and 36, there are plenty of semigroups of operators 
that are strongly, but not uniformly, continuous; however replacing strong with weak 
continuity doesn’t add much. 










Theorem 5. Let X denote a Banach space^ X(t); X ^ X. a one-parameter^ 
of bounded linear operators that is weakly continuous at t ^ 0: 

for all x in X and i in X\ Then Z(/) is strongly continuous. - - - 

一 ^ roo f 亍 his is a som eGii^r ^^Hs&g resultTT^ Semands mucii 

less than strong continuity. The proof, which we omit here, is rather tricky; see Hille-~ 
Phillips or Goldstein. □ 
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by its infinitesimal generator. 

Proof. Suppose that Z(/) and W(r) have the same generator G. Take x in D(G); 
using the commutation rules (14) and (12), we get 

-^-W(r)Z(j - t)x = W(r)GZ(j ： - t)x - W(r)GZ(^ - t)x = 0 (18) 

at 

for all x in D(G). Then, by the fundamental theorem of calculus, we conclude that 
W(^)a* = Z(s)x. Since D(G) is dense, this holds for all a* in X. □ 

We show now how to reconstruct Z(t) from G in the case where Z(r) is a con¬ 
traction, that is, 

|Z(r)| < 1 forallr >0. (19) 

Theorem 7. 

(i) The infinitesimal generator G of a strongly continuous semigroup of contrac¬ 
tions has eveiy positive, real X in its resolvent set, and 

iRw^iai-Gr^T- (20) 

入 

(ii) Conversely, even densely defined unbounded operator G whose resolvent set 
includes the positive reals, and whose resolvent is bounded by (20)，is the 
generator of a strongly continuous semigroup of contractions. 

This remarkable result is called the Hille-Yosida theorem, after its discoverers. 


Proof. Part (i) is just a restatement of inequality (17), with b = \,k = 0. 






We give Yosida’s proof of part (ii). It is based on approximating G by G„ - 
/iGR(/z) and letting n oo. The identity 


Gn = « 2 R(/z) - nl (21) 

shows that G" is a bounded operator. We approximate Z(f) by Z n (t) = e tGn where 
the exponential is defined as an infinite series. We show first: if (20) holds, then for 
all a: in X ， 

lim nR(/ 2 )x = x. f??) 

/i—>oo / 

We use the identity 

fiR(n) 一 I = R ⑻ G 

and inequality (20) to deduce that for x in D(G), 

|/jR(«).v-.-c| = \R(n)Gx\ < -|G.t|. 

n 

This proves (22) for 工 in D(G). Since, by (20), the operator nR ⑻ has norm < 1 for 
all /i, and since D(G) is dense in X, it follows that (22) holds for all x in X. 

Next we show that for all x in D(G), 

lim G n x = Ga' (23) 

By definition of G IU for all .v in D(G), = "GR ⑻ x = «3^/2)Gx，so (23) 

follows from (22). 

By definition of Z„, we obtain, using formula (21) for G /z , that 
Z n {t) = e ,G « = e - ni e n2R(n) t = e- n, 

o 

Using (20), we deduce that each Z n (t) i§ a contraction: 

w ^r (iy= e ~ n，e，，t = 1 - (24) 

_To_estimate the difference of 7, n nnd 7,j 

: with Z /l? and[ h . ___ 

—Z,i(j — = 2 

at 

Using (24), we get that the norm of the r 
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imbininff (23) and (25\ we deduce that for all x in D(G), the limil 


exists, uniformly on bounded sets ot it roilows men xrom tne umiorm oounacu- 
ness of \Z n \ that (26) exists for all x in Z. It follows directly that since Z n (s) is a 
semigroup, so is Z(s) .-Since-the convergenee-in-(;26)-is imiform-forK“S ， :strong^^^ 

"continuity of Z/i implies strong continuity-of l Zn-^ inc6-by-(24--)-csch-Z^-is.8. contrsc. •:- 

tion, so is their strong limit Z. 

It remains to show that the generator of Z is G. Apply (15’）to Z„: 


. Z„ 工心 : — 

Suppose that x lies in D(G); taking the limit n oo, we get, using (23)，that 

Z(t)x - x = f Z(s)Gxds. 

JO 

Denote by H the generator of Z(r). Dividing the above by t and letting / -» 0, we 
conclude that D(H) includes D(G) and that H = G on D(G). In other words, H is 
an extension of G. However, since, by theorem 4, 入 > 0 belongs to the resolvent set 
of both G and H, H cannot be a proper extension of G. □ 

Another proof due to Hille is based on replacing the differential equation (14) by 
the backward difference equation 

. mizJpLi!i}i =Gm)x . 

Solve this for W(/);. 

W(0 = (I-/!G) _1 W(f ~h). 

Set h = t/n, and set 1 = h,2h, ..we obtain recursively 

W(0 = (I - : G) . (27) 

' ! 

We denote these operators as Z, ? . We claim: 

(i) Each Z„(r) is a contraction. 

(ii) Z„ converges strongly to a semigroup whose generator is G. 


(i) is an immediate consequence of (20). 
1 in section 34.3. 









34.3 THE APPROXIMATION OF SEMIGROUPS 


The result described in this section is of considerable importance in analyzing the 
convergence of discrete approximation to the solution of partial differential equa¬ 
tions. We will first state it in the language of semigroups. We start with a strongly 
continuous one-parameter semigroup of operators Z(r), as in the previous sections, 
with infinitesimal generator G. We discretise time into integer multiples of a small 

unit A, imagined to tend to zero eventually, and define recursively discrete approxi¬ 

mations w ⑻ to Z{nh)x as follows: 

= C/, u (n) , « (0) = X, (28) 

Ch a bounded operator depending on h. The function u(t) — Z(t)x satisfies the 
differential equation 

^ = Gx. (29) 

Therefore — u n )/h ought to be an approximation to Gu. Computing this dif¬ 
ference quotient from (28), we are led to the following condition for the operator Cf t1 
called consistency: 

h|(¥- G ) M H =o (3o) 

for u{t) = Z(t)x, uniformly for all 0 < f < 1, for a set of a: that is dense in the 
domain D of G. 

The recursion (28) can be solved inductively to yield 

u M = Cl X (31) 

as an approximation to Z{nh)x. We impose now a second condition, callQd stability; 
that requires the approximations to depend boundedly on the initial vectors: 

|C；J < constr for nh < l (32) 

and for some constant independent of n or h. 

The following result is called Lax’s equivalence theorem: 

Th 的 re 执一一 gly~rnmimmi rdim=pnmmeter semigroup acting on - 

a Banach space X. Let (28) be a discme‘approximatioirto-- 取作 : satisfyin^^^^ 
sistency condition (30). Then u9]1.j= Cj x x tends to Z{t)x as nh tends to t, for all x 
in X， iff the scheme is stable in the sense of condition (32). 

Proof, The necessity of condition (32) follows from the principle of uniform 
boundedness; see theorem 7 of chapter 15. To prove sufficiency，we use the fol¬ 
lowing version of a high school algebra identity, valid for noncommuting operators 
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A and B; 

- A ri - B" (A-B) + A»- 2 (A - B)B + ... + (A - B)B , '- 1 . 

We take A = C/!，B = Z(/7.)，and let both sides act on a vectors: 

...-—. cu ... 

= 2 C' ； - J -\c h - Z(/z))h(j/i); (33) 

0 

——here we have used the semigroup property, and the notation u(t) = T^f)X7~ 

By the definition of infinitesimal generatbr, for every a* in the domain of G ， 

Z(h、x = x + hGx + 办， 

where ^ is a vector whose norm |5/ ? | is o(h). So, for any t, 

Z(h)u(t) = Z(h)Z(t)x = Z(t)Z(h)x 

= Z(t)[x + AGx + 私 ] =w ⑴ + hGu{t) + Z(r)^/,. 

We use this expression of Z(h)u to write 

(C/ 7 — Z(/i)) u = (C/, —I — hG) u + Z(f )sij. 

Setting this into (33) gives 

Gj t x — Tj(jxh)x = 一】 [C/z - I — hG]u(jh) + ^ ^Zj(jh)si } . (330 


Using the consistency condition (30)，the estimate \sh | = o(Ji), the stability condition 



h tends to zero. This proves that C^x tends to Z(/)x, as nh tends to t and h tends to 
zero, for all x for which (30) holds and for r < 1. The set of such x is dense in X\ 
since the operators C^J and Z(t) are uniformly bounded, the conclusion holds for all 
x. The extension to / > 1 is obvious. : 口 


The equivalence theorem is the framework of finite difference and other discrete 
approximations for solving time-dependent partial differential equations; see Lax 
and Richtmyer. There C/, is a difference operator in the space variables, G a partial 
differential operator; consistency is verified, in the class of smooth functions* by 
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Example L Suppose that Z(t) is a semigroup of contractions, and let us choose for 
C/ 7 the backward difference operator (27) employed by Hille in the proof sketched 
at the end of the previous section: 

C/, = (I-/iG)-'. (34) 

We claim that this choice of Cj t leads to a consistent and stable difference scheme. 
Consistency follows from this string of algebraic identities: 

义二 ! _ G = (I - hG)- ] (I 一 (1 ~ /z(?) - G = (I - /jG)~'G-G 
h h 

=(I — /iG) _1 [G - (I- hG)G] = h(l - /iG) -l G 2 

= (/r l I-G) _l G 2 . 

Clearly, if x belongs to the domain of G 2 , then it follows from estimate (20) that the 
consistency condition (30) is satisfied. Since we have shown in theorem 4 that the 
domain of G 2 is dense, consistency follows. 

Exercise 7. Prove, using (20), that the scheme (34) is stable. 

Theorem 8 is only one of several approximation theorems for semigroups. Other 
versions are due to Trotter, Kato, and. ChemofF (see sections 7 and 8 of chapter 1 
of Goldstein, and Strang). A useful consequence of Trotter’s theorem is Trotter's 
product formula* 

Suppose that G andYL are generators of strongly continuous semigroups T and S 
of contractions, and that the closure c?/ G + H also generates such a semigroup Z. 
Then as nh /i 一 》 0 ， 

lim [T(/i)S(/i)f x = Z(t)x (35) 

for all x in X, 

The significance of this result is that many infinitesimal generations of physical 
processes have a natural decomposition as a sum. Also in many problems it is ad¬ 
vantageous to compute approximately T(h) and S(/i) by entirely different methods. 
Strang has pointed out that 

[t Q) swt J .v=t f ^ mmh)r~ l s(h)T f ^ ^ _ 

is a much better approximation to Z(nh)x than [T(/!)S(/z)]'\:. 

Exercise 5. Why? 


It is natural to ask if it is possible in theorem 8 to dispense with the hypothesis 
that G is the generator of a strongly continuous semigroup, and instead deduce this 
from the existence of a stable difference scheme (28) that is consistent with G. In the 








context of partial differential equations, this would amount to proving the existence 
of solutions of partial differential equations by approximating them with solutions of 
difference equations. We offer only a weak abstract result; for simplicity we take X 

Ibra..HilbBTT§pe~6:. . ... 

Let G be a densely defined closed operator whose adjoint also is densely defined. 
A weak solution of the equation :- - - - …-… 



is defined as follows: 
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We introduce the Hilbert space H as the completion in the "-norm of continuous 
vector-valued functions w{t)\ the //-norm is defined as 

inii 2 w = C \\wm 2 dt. 

Jo 

Given a solution of the difference equation (38)，we extend it as a function of t by 
setting 

m/ z (0 = w ⑻ for nh <t<(n+ l)h. 

Clearly, the /f-norm of iih is 

N 

II 叫临 ="1>, 2 , Nh = \. 

0 

The stability condition implies that ||« (/1) || < const, for all n, nh < 1; it follows 
that \\uii\\h S const, for all h. We appeal now to the weak sequential compactness 
of bounded sets in Hilbert space (see chapter 10) and conclude that we can select 
a subsequence of A — 0 so that [u^} converges weakly to some limit u in H. We 
claim 

Theorem 9. The weak limit u is a weak solution of (36). 

Proof. To show this, take any admissible test function wit), and take the scalar 
nrnduct of = w(nh) with f38). MultiDlv bv h and sum with resoect to n: 


$ 卜⑷， 


Sum by parts, and use the adjc 

.get... 



r ' Ndvrreplacein (40)' the funct: 
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(40’) tends to _ . .. 一 . . . . . 

J ( 是 + G*iu, dt + (w(0),x) = 0. 

This proves that u is a weak solution of (36). 


□ 


REMARK 1. The preceding argument shows that the weak limit lies in H\ that is, it 
is square integrable. It is easy to show that it is in fact bounded. Take any interval 
[o7b] in [Orrj；~according to^ie~stabifity condition, .'|| u || 2 < const, for aH'/j. It 
follows that 


\\in t (t)\\ 2 dt 


一£ —■N 负姐 |卜 2 -1(&•— £7>eOflStv …- 
• a<nh<b - 

The L 2 -norm is lower semicontinuous under weak convergence, so 

f \\u(t)\\ 2 dt < liminf f \\uij\\ 2 dt < (b — a)const. 

Jo Ja 

This proves that ||w(OII 5 const, for almost all t. 


Remark 2. The consistency condition (39) can be relaxed by requiring (39) hold 
only on some dense linear subspace W of the domain of G*. This requires a corre¬ 
sponding change in what test functions are admissible. 


Remark 3. A famous theorem of Friedrichs shows that for first-order partial differ¬ 
ential operators, a weak solution u is a strong solution, that is, an lr limit of genuine 
solutions. I don’t know of any comparable abstract result. 


34.4 PERTURBATION OF SEMIGROUPS 

Rellich has shown (see theorem 5 in chapter 33) that if one adds to a self-adjoint 
operator A another symmetric operator not too large compared to A. then the sum 
马 Iso is self-adjoint. A similar result holds for generators of contraction semigroups. 
Before we state it, we explain a reformulation due to Lumer and Phillips of the Hille- 
Yosida condition on the generators of contraction semigroups, For simplicity we state 
it only for semigroups acting on a Hilbert space. 

Lemma 10 (Lumer-Phillips). Let G denote a densely defined operator on a Hilbert 
space H whose resolvent set includes R+. Then inequality (20), necessary and suffi¬ 
cient for G to generate a semigroup of contractions, is equivalent to 


for all x in the domain of G. 


Re (a% Ga*) < 0 


( 20 ') 
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An operator satisfying (200 is called dissipative. 

Proof. Condition (20) asserts that for all u in H and all 人 > 0 ， 

Denote ( 入 I 一 G) 一 1 w as Then the inequality above can be rewritten as 
(x y x) < - G.v, \x - G,r). 

Expanding the right side，cancelling (x, x) on both sides，and multiplying by A, re¬ 
arranging terms, we get 

(,r, Gx) + (Gx, x) < —||Gx||~. 

Letting X tend to oo we obtain (200. 

The converse can be shown by running the proof backward. □ 

The following result is due to Trotter: 

Theorem 11, Suppose that G is the generator of a semigroup of contractions on a 
Hilbert space H. Let H be an operator with the following properties: 

(i) The domain of H includes the domain of G. 

(ii) Ris dissipative, 

(iii) There exist numbers a and b f a < \, such that 

liavll <fl||av||+fc||.r|| (41) 

for all x in the domain of G. Then G + H, defined on the domain of G, gener¬ 
ates a semigroup of contractions . — 一一 

Proof. Since G is the generator of a strongly continuous semigroup, it is a closed 
operator. We claim that so is G -f H. To see this, let be a convergent sequence 
such that (G + = y n also converges. We_write_G.Y,! = y n — ELr, 7 and form the 

difference 

GCx ;I - x m ) = y n - y m - H(a." — x m ). 

Using inequality (41) on the right, we conclude that Gx n converges, and conse¬ 
quently also HLr /2 converges. Since G is closed, Gx n Gx where x n x. So 
x belongs to the domain of G. That HLv /z converges to Ha* follows from (41). 



We claim that 入 sufficiently large positive belongs to the resolvent set of G + H. 
- -First we show that for every a* in the domain of G, 

ll^ll<^ll(n-G + H)A-||. (42) 

According to the Lumer-Philiips lemma, G, being the generator of a semigroup of 
一一 eemtmetionsr:is:dissipative:: H4s dissipative by hypothesis; therefore so is their sum. 
•… -Inequalky-(42) follows-from the converse-part-ef the-Lumer-Phillips lemma. 

If follows from (42) that the range of G + H — AI is closed. To show that it is 
the whole space we argue indirectly; if it were not, there would a nonzero vector v 
perpendicular to the range^for_all x in £>(G), 

((G + H -入 I)Jt ， u)=0. (43) 

Since G generates a semigroup of contractions, G — 入 I is invertible; so there is an x 
in the domain of G such that 

(G — XI)x = v. (44) 

Setting this into (43)，we get 

M 2 + (Hx ， u) = 0. 

Using the Schwarz inequality to estimate the second term, we get 

llvll < (45) 

Using (44) to express v, and inequality (41) to estimate the right side of (45)，we get 
l|Gx-；^ii < a\\Gx\\^b\\xl 
Square both sides; using the fact that G is dissipative, we get 

||Ga-|| 2 + ^ 2 ||^|| 2 < fl 2 ||Gx|| 2 + 2ab\\Gx\\\\x\\ + b 2 \\x\\ 2 . (46) 

Since a < 1, for X large enough we conclude that ||a*|| = 0. That makes a* = 0 and 
u == (G — 入 I)a : 〒 0; this proves that the range of G + H — AI is the whole space. 

This, combined with the Lumer-Phillips lemma, shows that G + H generates a 
semigroup of contractions. □ 

A statement and proof of Trotter’s perturbation theorem in a Banach space setting 
is given in Goldstein. 


34.5 THE SPECTRAL THEORY OF SEMIGROUPS 


We saw at the beginning of this chapter that when the generator G is a bounded 
operator, the semigroup is the exponential of G: 





THE SPECTRAL THEORY OF SEMIGROUPS 435 

Z(0 = (47) 

by the spectral mapping theorem, theorem 4 of chapter 18 ， 

cr (Z(fJ) = e f<r(G) . (48) 

When G is unbounded, (47) holds only in a symbolic sense. The question is: does 
(48) hold? 

It is easy to show that if y is an eigenvalue of G, then e tY is an eigenvalue of 
Z(t). To see this, let u be a corresponding eigenvector: Gu = yu. Then 

^- e - yl Z(t)u = Z(f)(G - yl)u = 0, 
at 

which means that e~~ yt Z(t) is independent of t. Since its value at t = 0 is u t it 
follows that e^ yr Z(t)u = u for all t. This shows that e yt is an eigenvalue of Z(t). 

Ralph Phillips has shown that the same conclusion holds for any y in the spectrum 
of G: 

Theorem 12. Z(r) is a strongly continuous one-parameter semigroup of operators ， 
G its generator. Then 








By (9), y belongs to the spectrum of G belongs to the spectrum of 

— )• 

"— 0^(0) =1 石一 一 r :. (51) 

According to Gelfand’s theory (see theorem" T4 of chapter 18)lhe spectrum of R is 
.—〒 a,(R)^4p(R))^.pj^gesm.&t.^ this 

…-“ -with (51), we corcJ 止 d_e_lhaLfor.eYery4LkurJ ： G)Ltliereis a„p- 丄 Ad .CsuclithaL_ 


p(R(r)) = (?-y) -1 . 

(510 

- I:et- ： /?-act.on (50); since~pts*tnultiplicative, we get 

—— 

p(R)p(V(t-bs)) = pm))p(V(s)). 

(52) 

It follows from (510 that p(R) ^ 0. We define 


m(t) = (pY(t))/p(R) 

(53) 

and rewrite (52) as 


m (t + >?) = m(t) m(s). 

(54) 


We have shown above that V(0 is a continuous function of r in the uniform topol¬ 

ogy; the homomorphisms are continuous in the uniform topology. Combining these, 
we conclude that p(V(r)), and therefore m(t), is a continuous function of M -» C. 
As is well known, all continuous solutions of (54) are of the form 

m(r) =〆，. (55) 

Apply p to (49) to get p(V(t)) = p(R)p(Z(t))\ combining this with (53) and (55) 
gives 


Now multiply (16) by R: 


p(Z(t)) = e K! . 



e~^ s RZ(s)x ds. 


(550 


As shown above, RZ(<y) is continuous in the uniform topology; therefore the integral 
above exists in the uniform topology 
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Apply p to both sides and use (55 ; )： 

p(R) 2 = f p(R)p(Z(s))ds = p(R) f e^ s e KS els =——. 

JO JO ? 

Dividing by p(R), we get p(R) = (( 一 /c) 一 1 .Comparing this with (51’)，we conclude 
that«: = y. Setting this into (55’）gives 

/7(Z(0) = ^. (56) 

According to Gelfand’s theory, theorem 14 of chapter 19, p(Z(t)) is in the spectrum 
of Z(t). Since y is any point in the spectrum of G ， we conclude from (56) that 
a(Z(t)) contains e ta ^ G \ as asserted in (48’). □ 

The inclusion in (48’）is in some cases proper. Phillips has, however, shown 

Theorem 12 ; . Let Z(t) denote a strongly continuous one-parameter semigroup of 
operators，G Its infinitesimal generator. Suppose that for some T > 0, Z(f) is uni¬ 
formly continuous for t >T. Then every nonzero point in the spectrum of Z(t) is of 
theforma(Z(t)) = e ta ^ G \ 

Proof. The spectrum of Z(t) is the range of the homomorphisms p : A C. 
Applying p to Z(s + 0 = Z(s)Z(t) and abbreviating p(Z(t)) as n(t), we get 

n(s H- t) = n(s)n(t). 

Since Z(t) is assumed uniformly continuous for t >T, it follows from the definition 
that n(t) is continuous for t >T. The only such solutions of this functional equation 
_ 0 are the exponentials: n{t) = e vt . The rest of the proof proceeds as that of 
theorem 12. . . □ 

Theorem 13. Let Z(t) be a strongly continuous one-pcirameter semigroup of oper¬ 
ators, G its infinitesimal generator. Suppose that for some T > 0, Z(7) is compact. 

(i) the nonzero part of the spectrum of Z(0 is ofthefonna(Z(t)) = e /cr ( G ). 

(ii) the spectrum of G consists of discrete points {yy}, Re y\ > Re y 2 > • • •» Re 

—oo. 

~~(iii j for every vector x, 2 (/ )x has an asymptotic expansion for large t of the form 

Z ⑴ x 〜心叩 pj(th. (57) 

where pj are polynomials in t whose coefficients are generalized eigenvectors 
of G. . .. 


Exercise 9. Prove theorem 13. 
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We turn now to the question of the transpose of a semigroup and of its generator 


- 抑足 m&e, ZLO. - X x a Strongly contin- 

uous one-parameter semigroup of operators. Then its transpose Z f (t) \ X 1 X 1 is 
likewise a strongly continuous one-parameter semigroup of operators, generated by 
the transpose of the generator ofZ(t). 


-- P/w/ By definition of the transpose^- - - - 

(Z(t)x,i) = (x,Z'(t)e) (58) 

- 二二 X' From thfs and the reflexiVSy"oTX we deduce thaF 
Z\t) is weakly sequentially continuous. But then, by theorem 5, Z ; (r) is strongly 
continuous. 

Denote the generator of 7J{t) by H. that of Z(t) by G. Choose x to be in D(G), 
i in D(H). Differentiate (58) with respect to t and set / = 0: 

(Gx,£) = (x,m). 

Comparing this with the definition of G'，we conclude that G’ is an extension of H. 
By theorem 4, all k > k belong to the resolvent set of H; the resolvent set of by 
theorem 3 the same as that of G, also contains these points. But then G f cannot be a 
proper extension of H, and thus G’ is the generator of Z’(，). □ 
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GROUPS OF UNITARY 
OPERATORS’— 


The.mathematical landscape is full of groups of unitary operators. The ones we will 
consider in this chapter, strongly continuous one-parameter groups U(t )、-oo < 
r < 00 , come mostly from three sources: processes where energy is conserved，such 
as those governed by wave equations of all sorts; processes where probability is 
preserved, for instance, ones governed by Schrodinger equations; and Hamiltonian 
and other measure-preserving flows. 


35.1 STONE’S THEOREM 

Theorem 1. Let A. be a self-adjoint operator acting on a Hilbert space H • 

⑴ There exists a strongly cominuous group U(r) of unitaiy operators whose 
infinitesimal generator is / A. 

(ii) Conversely, every strongly continuous group of unitary operators is generated 
by i A, A some sdf-ddjdiiii operaidi: 

Proof. (i)We saw in theorem 5 of chapter 32 that every nonreal complex number 
z belongs to the resolvent set of any self-adjoint operator A, and [see (13) in chapter 
32] that the resolvent is bounded by 

IIR ⑵ II 5 Ilnur 1 . 

It follows from this that both /A and 一 /A satisfy the hypothesis of the Hille-Yosida 
theorem，theorem 7 in chapter 34, so both /A and 一 /A generate strongly continuous 
semigroups of contraction; denote these by U(/) and V(/). We claim that V(/) and 
U(/) are inverses. To see this, take any x in the domain of A and form 








B ； :4詞 


丄 ne spectnu resoiuuon oi a seii-aajuiiu um 
defining the functions 05 (A), where cs is the 
set 5. The three constructions we gave in chap 
A all started with a more limited functional ci 
the resolvent (A - zl)" ! , s nonreal; in section 
(A — /I)(A + il) -1 ; section 33.3 was based on 
Stone’s theorem can be interpreted as defining 
sketch how to build out of this functional calcul 





















442 


GROUPS OF UNITARY OPERATORS 


Lemma 2. Let be a strongly continuous one-parameter group of unitary oper¬ 
ators acting on a Hilbert space H. Let u be any vector in HjJhmJke- 細 : ctkm _ 

is positive definite in the sense of Bochner; see chapter 14, section 14.4. That is, a(t) 
is skew symmetric, continuous, and 

. . .—- . (f j 2 -0 _ 

for all choice on R and all complex numbers 0! ， ... ，於 / </. 

•― Proof ： By the definition (30) s the group property.. of_J7(0r.and_the„^LJtoat 
U (- 0 = U(f)- 1 = 卿 )， we have ^ 

- = y^(U(Q- 一 4*)w, u)<pj^k = 

=X>U(4) U ) 


clearly a nonnegative quantity. □ 

According to Bochner 5 s theorem, theorem 8 in chapter 14, a positive definite func¬ 
tion is the Fourier transform of a nonnegative measure; so the function (2) can be 
represented as 


mt)u,u )： 


J e ia dm(k). 


(3) 


Setting r = 0, we obtain ||«|| 2 = m(R). The measure m depends on the vector w; it 
is the Fourier transform of (U(/)w, u) and is therefore uniquely determined by u. 

The right side of (2) is a quadratic function of u\ we associate with it a skew 
bilinear function 


(U(r)H, v) = / e ITA dm(X; m, u), 


⑷ 


where the measures m(w, v) are formed out of the measures m{u) by polarization. 


Lemma 3. The measure m(w, u) has the following properties: 

(i) m(«, v) = m(u). 

(ii) m(v ， u) = m(u ， v). 

(iii) ni(u,v) depends linearly on u, skew linearly on v. 

(iv) \m (S, w, v)\ < \\it\\\\v\\forany Borel set S. 


Proof. The proof is the same as of theorem 17 in chapter 31. 


□ 







We appeal now to theorem 1 of chapter 31 on bounded sesquilinear forms to 
conclude that for any Borel set 5, 

m(S, u r v) == (E(5)m, v). 


where E(5) is a bounded, symmetric operator, ||E|| < l. So we can rewrite (5) as 


(U(r)a, v )= 



(5) 


We claim that E is the resolution of the identity for the operator A, meaning that 
it has properties stated in theorem 1 of chapter 32. The proof is analogous to the 
arguments used in chapter 31, section 31.7, on the spectral resolution of a single 
unitary operator. We leave it to the reader to complete the details. □ 


35.2 ERGODIC THEORY 


Strongly continuous groups of unitary operators serve as a good setting for the mean 
ergodic theorem. We present the abstract theorem in this section, the connection with 
ergodicity of dynamical systems in the next section. 

Theorem 4 (von Neumann). Let U(0 be a strongly continuous one-parameter 
group of unitary operators mapping a Hilbert space H onto itself, 

(i) Denote by F the set of all vectors f in H that remain fixed under the action 
of the group: U(0/ = f for all t. Then F is a closed linear subspace of H. 

(ii) Denote by M(0 the averaging operator 

1 f r 

M(t)g = 7 / UCj)(6 ) 

t Jo 

Then as t oo, M(f) converges strongly, meaning that for every g in H, 

_ _ —.... s: = Pg 

exists, and the strong limit P is the orthogonal projection onto F. 

— £rflc?/JIhe-originaLpiQaLQf-voii_Meumann reJled.oiLt:he_spectraLresoludoiLof-iin=. 

•—bo 胆 ded .self: 马 djgint .opex 印 ： ors. The simple proof presented here is due to Eberhardt 
Hop£ 

That F is a closed linear subspace is obvious, for the nullspace of the continuous 
operator U(f) — I is closed and linear and so is their intersection F. For part (ii) we 
need first 


Lemma 5. Let V be a unitary operator, E the nullspace ofV — I, and R its range. 
We claim that E is the orthogonal complement of R. 










Clearly, as f oo, the second term on the right tends to zero, and the first term 
tends to 



Since U(j) commutes with U(r) — I, it maps its nullspace E into E. Therefore (8) 
belongs to E. 

According to (7), every g in H can be decomposed as e + z\ it follows that the 
limit M(0 尽 as f — oo exists for every g in H, and that the limit belongs to E, the 
set of vectors fixed under U(r). Since r is arbitrary, it follows that the limit belongs 
to F, the set'of vectors fixed under the action of every operator of the group. 

The operators U(j) act as the identity on F; we claim that they map the orthogonal 
complement of F into the orthogonal complement of F. To see this, suppose that w 
is orthogonal to F\ the relation 

(V(s)w 1 f) = (w. U* ⑷ /) = (u ;， V(-s)f) = (w, /)-0 

shows that then U(^)iy is orthogonal to F. Since the operators M(0 defined in (6) 
are averages of the operators U(^), they also act as the identity on F and map the 
orthogonal complement of F into itself. The same is true of the strong limit 
Since we have shown that this strong limit maps H into F, it follows that the strong 
limit maps the orthogonal complement of F into 0. This shows the 5 — lim f —oo M(r) 
is orthogonal projection onto F. □ 


35.3 THE KOOPMAN GROUP 

Let M be an open, compact differentiable manifold, with some prescribed volume 
element V. We wish to study volume-preserving flows along vector fields D, namely 
solutions of the differential equation 

dx • 
dt 

Among such flows are those along Hamiltonian vector fields. Denote by x(y; t) the 
position at time t of that solution of (9) whose value at time zero is y; the mappings 
y .v(v ； t) are volume preserving. Since MHc6ifT_f7TtTV5nfiiiFT§"friYrtm6' 
石 ltrZTiFilTd^p^iitdennjf if rtheTefore"* 1 ' : 

. 5).0 = -v(2 ；5 + r). (90 

Bernard Koopman has associated with every such flow a one-parameter group of 
unitary-operators, acting on the Hilbert space of square integrable functions g on M: 


(U ⑴ g)00 = 


( 10 ) 
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It follows from the volume-preserving character of the flow that the operators U(r) 
defined above preserve the L 2 -norm; in fact they preserve all the L^-norms, 1 < 

What does the mean ergodic theorem, theorem 4, say about these flows; that is, 

. what can we sayjiere about tibe space F of functions that remain fixed under the op¬ 

erators (10)? Obviously..every constant function remains fixed; are there any others? 
Ler/ Be' slich' a function; we may take / to be real. Let c be any real number, the 
serS c of points-jin M-where /( 7 )-<rc is then invariant-\xx\Atx the flow; that is, if 

•v belongs to S c , so do all points x(y; t)Af / is nonconstant, there is a value of c for 

which the invariant set S c is nontrivial in the sense that neither S c nor its complement 
—in M has measure zero. Converse ly，if there is a nontrivial set S invariant under the 
flow, its characteristic function 

,,,11 ifyinS 
/0，)= (o if not 

remains fixed under the group (10). So we have shown: 

Only the constant functions on M are fixed under all operators of the Koopman 
group associated with a given volume preserving flow on M iffM has no nontrivial 
measurable subset that is invariant under the flow. 

A flow that has no nontrivial measurable subsets is called metrically transitive. 
Suppose that there are no nontrivial invariant sets under the flow (9). Then F 
consists of constants, and according to the mean ergodic theorem, for every function 
g in L 2 (M), M(t)g tends to the projection of g into the space of constants. What 
is that projection? Clearly, it is the mean value of g over M with respect to the 
prescribed volume: 


P8 = ^mL sWdv - ⑼ 

The projection (11) is called the space average of the function g; the limits of the 
averaging operators M defined by (6) are called time averages of g. So, loosely 
speaking, the mean ergodic theorem applied to the Koopman group asserts that the 
space average and the time average of an arbitrary L 2 function are equal, provided 
that the flow (9) in question has no nontrivial invariant subset. 

In statistical mechanics the manifold M is phase space, the vector field derived 
from the Hamiltonian of N interacting particles, N ~ 10 23 . The time average is inter¬ 
preted as the measured value of a function g of N variables; the time of measurement 
is large on the scale relevant in theorem 4. The significance of the ergodic theorem, 
first proposed in a rudimentary form by Ludwig Boltzmann, is that instead of having 
to solve a differential equation of form (9), involving about 10 23 unknown functions, 
we merely have to evaluate an integral (11) over a 10 23 -dimensional manifold. 

Jack Schwartz has pointed out that the functions g whose measured values have 
thermodynamic significance are highly symmetric functions of their 10 23 variables. 
The equality of the time and space average of such highly special functions might 
well be due to reasons other than the ergodic theorem. 
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remark that in general, it is very hard to decide which flows have nontrivial 
mt subsets and which don’t. An amusing example is given in Lax. 

□HE WAVE EQUATION 



[« ⑼， u t (o)} _)， 
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It is not hard to show (e\g., by 'taking the spatial Fourier transform) that the initial 
value problem, that is, the problem of finding a solution in full space-time of the 

wave eguation with prescribed initial data u(o) and W/ (o), can be solved if the initial 

data are sufficiently smooth and have compact support. So the operators U(f) are 

well defined on such data.. Since these data form a dense subspace of all data H, 
we can, "by continuity, defirie'tfie operator U(r) on all of H. It is not hard to show 
二 fthBi the extended opemtors form a strongly continuous group of unitaiy operators. 
—— Strong-continuity- can be.-veri.fied Jor-smooth solutions with smooth initial data, and 
then extended by continuity to all initial data with finite energy. The group property 
UO + ;•) = UCs)U(r) expresses the fact that if w(x ， y ， z, 0 is a solution of the wave 
equation, so is u{x, ,v, z, t — s). Unitarity is merely a restatement of the.conserva¬ 
tion of enefgy7 plus the facTEIiat each operator U"0T)~is Invertible; the inversion* fs' 
accomplished by solving the initial value problem backward. 

Exercise L What is the infinitesimal generator of the group formed by the solution 
operator of the wave equation? 

We describe now an important extension: the study of solutions of the wave equa¬ 
tion for all time but not in the whole three-dimensional space, only in the exterior of 
some obstacle B. On the obstacle all solutions are required to satisfy the boundary 
condition: w = 0 on the boundary of B. 

The conservation of energy is derived the same way as before. This time integra¬ 
tion by parts produces the boundary term 

/ u t u n dS, u n = normal derivative of m, 
dB 

which is zero for all u that vanish on the boundary. Note that we could also have 
chosen to impose the boundary condition u n = 0, to make the boundary term vanish. 
In either case the laws of conservation of energy follows. 

We then proceed as before to construct the solution operators U(0, which form 

a strongly continuous group of unitary operators in the energy norm. The only new 

complication is that it is harder to prove the existence for all times of solutions of 

the wave equation with prescribed smooth, initial data, and satisfying the boundary 

condition. But this is only a technical difficulty, which should not be allowed to 
obscure the simple underlying structure of solutions with finite energy in the exterior 
of an obstacle. 


35.5 TRANSLATION REPRESENTATION 

The spectral representation of a self-adjoint operator A acting in a Hilbert space H 
is described in chapter 32. When A has spectrum of multiplicity 1, there exists a 
nonnegative measure m on R and a unitary mapping between H and L 2 (E, m): 

H —^ L 2 (l, m) 



























By taking a Fourier inverse, we can obtain from (13) another representation 

•• ~ (130 


That is, if the vector u in H is represented in (13) by the function /( 入 )， then (130 
assigns to u the representation 



It follows from theorem 6 that (exp ikt)u is represented in (13) by txp(iXt)f(X), 
and so in (13 7 ) by k(x — t). For this reason (13’）is called ^ translation represen- 
:tatio.n_s£...H for the unitary-group generated-by~/-A. Conversdyr^FO 扭 ■ 去 扭瓶 tetion 
representation we can construct a spectral representation by Fourier inversion. 

We turn now to a geometrical characterization of translation representations due 
to Sinai. Let H be a Hilbert space, U(r) a strongly continuous one-parameter group 
of unitary operators on H that has a translation representation. Denote by F the 
subspace of H consisting of vectors represented by functions supported on R 一： 


F > L 2 (N,R-). 


(14) 


Clearly, U(r)F consists of vectors represented by functions supported on (—oo,r). 
It follows then that the one-parameter family U(r)F increases as r does，going from 
{0} to H as r goes from 一 oo to cx>. We express this more precisely thus: 

U(r)F C F forr <0, (15a) 

nU(r)F = {0 }， (15b) 

UU(r)F = H. (15c) 


Theorem 7. Conversely, let H be a Hilbert space, U(0 a strongly continuous one- 
parameter group of unitaiy operators mapping H into H. Let F be a closed sub¬ 
space of H t and suppose that all three conditions (15) are satisfied. Then H lias a 
translation representation (13 f )for\i{t) where F is given by (14). 


Exercise 2. Show that U(^)F C U(f)F when s < t. 

Sinai deduced this result from von Neumann’s theorem about the Heisenberg 
commutation relation. Phillips and the author gave an independent proof and showed 
how to deduce from it the result of von Neumann; we will present that in the next 
section. We give here the proof due to Phillips and Lax; it is a little technical, but 
pretty, at least in a parent’s eyes. 


from another representation theorem: 








TRANSLATION REPRESENTATION 


to 0 strongly as t tends to oo; 

lim Z(t)k = 0 (: 

S—OO 

for all k in K. Then K can be unitarily represented as a closed subspace 
M_), N some auxiliary Hilbert space, so that the action ofZ(t) is tra, 
lotion to the right by t t restricted to 

Proof. Denote the generator of the semigroup Z by G, D(G) the domain of 
We want to define first a representation for vectors g in D(G), and then extend t 
representation by continuity to all of K. To any g in D(G), we assign the vect 
valued function y (s) as 

y(s) = Z(-s)g, s <0. ( 
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疆獅 ! W Ew^f taWtfi nBfiMiTflBTiTi w 




to F for r > 0. To see this, take any vector / in F; since U(/*) is unitary, U*(r)= 
U- ! (r)=U(-r): — 


(/ ， U(r)p) = (U*(;‘)/ ， p) = (U (— ?•)/ ， p) • 


According to assumption (15a), U (— ，•）/ belongs to F; since p is orthogonal to F, 





<0 such that 11/ — U(r)^|| < e. Therefore, writing s = -r, we have ||U(j)/ — 
|| < e. The projection P maps g into zero. Therefore applying the projection P, we 
onclude that ||Z(^)/|| = ||PU(5)/|| < €, Since the operators Z are contractions, 
Z(0/ll < ^ for all t > s. □ 
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It follows from lemma 9 that theorem 8 is applicable to the semigroup Z(t )= 
PU ⑴ defined on F. There is an isometric translation representation that assigns to 
each / in F a function (/> in L 2 (N, JR 一）： 



f 

<~~ (j){s) 

(23) 

so that for any t > 0, 

pu(o/ ^ 

— c(t)(f)(s-t), 

(230 

where c is the character 

istic function i 

ofK-. The representation is isometric: 



ll/ll 2 = 

r0 

/ 1_ 呤心， 

/-oo 

(24) 


IIPU ⑴ /| 卩 

= [ II0W||^ 

J-oo 

(25) 

We extend the representation to all vectors in H of form V(t)f, f in F, by setting 


U(0/ 

< ~ ^>(s - t). 

P6) 


It follows from (24) and the isometry of U(0 that this assignment is isometric. We 
claim that (26) is consistent with (23 7 )； that is, ifU(0/ belongs to F, then the right 
sides of (23’）and (26) are equal. Clearly, this is so iff the function ^ representing / 
is zero for ^ < 0. That this condition is satisfied can be seen by comparing 
(24) and (25). The left sides are equal; therefore so are the right sides, which can be 
only if 0 == 0 in (—t, 0). 

According to (15c), vectors of form U(r)/, / in F, are dense in H. Therefore the 
representation (26) can be extended by continuity to all u in H: 

u <~> k(x). 

This representation is isometric: • 

厂 CO 

ll«ll 2 = / ll*(-v)|| 7 v dx. 

Joo 

transmutes the action of U into tran«>lnHnn 1 _ 

-f),. 

and the action of P into truncation, 

Pw <—~> c(s)k(s). 

It is not hard to show that the range of this representation is all of L 2 (N,R); see 
Lax-Phillips, 1981. □ 



The conditions under which a unitary group has a translation representation may 
appear somewhat special. Nevertheless, there are natural, interesting, and nontrivial 
examples coming from wave propagation as described in section 35.4. There the 
'undedying Hilbert space H is the set of all initial data {«( 文， 0), 0)} defined in 

M 3 with the energy norm 

.||{«(A ： ),M,(x)}||g = J (J2 u l + u ?) dx - 

The group U(/) is the group of solution operators for the wave equation 
iiti — Au = 0, 

uT7TTti^ ， _ornr/t«7orr=v{w( ： t ， 0 ,吣 . —…‘ 

The unitary character of U(0 expresses the conservation of energy and the re¬ 
versibility of time. 

The role of the distinguished subspace F is taken by the so-called incoming initial 
data. These are initial data of incoming solutions u(x, t), which are zero inside the 
backward light cone : 

u(x, t) =0 for \x\ < —t. 

It is far from obvious that there are any incoming solutions at all. We show now 
how to construct them by relying on Huygens’s principle for wave propagation the 
three-dimensional space. Conceptually this principle says that solutions of the wave 
equation propagate information with speed equal to 1. Technically this means that 
the value of a solution u{y, s), (j, s) in M 3 x M, uniquely determined by the initial 
data («(0), W/(0)}, depends only on the values of w(0) and u t (0) and their space 
derivatives, at the intersection of the light cone with the initial plane, that is, the 
points x satisfying \x 一 y\ = |*s|. 

Let /=={/!， f 2 ) be data that are zero for |a*| > S. We claim that for T > S, 
U(—T)/ is incoming. 

Exercise 3. Show, using Huygens’s principle, that U(—T)f is incoming. 

We show now that the space F of incoming data has all three properties listed in 
(15a) to (15c): 

(i) Let u denote an incoming solution, / its initial data. The solution whose 
initial data are JJ(r)f is w( r )(x, f) = w(jc, t + r). Since u is incoming, 

»(A*,r) = 0 for |a-| < —r. 

Therefore 

w ⑻ (a 、 r) = 0 for |jc| < -(/ + /•). 


When r < 0, u r is incoming; this shows that U(r)F C F for r < 0. 
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(ii) The relation above shows that u {r) (x t 0) = 0 for |.t| < -r and, after differ¬ 
entiating with respect to f, that u\ r \x y 0) = 0 for ||.\:|| < ~r. It follows that 
the intersection of U(r)F contains only the zero data. 

(iii) Let it be any of the incoming solutions constructed above, / its initial data. 
It follows from the construction that U{T)f can be taken as arbitrary data 
supported in |.r| < T. Clearly, the union of U(7)F as r — oo is dense 
in H. 

Huygens’s principle is valid in any space of odd dimension, and so is the analysis 
of incoming data. One could deduce the properties of incoming data from explicit 
formulas for solutions of the wave equation in E ,z x R, but our derivation is more 
illuminating. 

Far more interesting is the case of wave propagation in the exterior of an obstacles, 
discussed in section 35.4. Suppose that the obstacle is contained inside a ball around 
the origin of radius R. We define an incoming solution u{x, t) as vanishing inside 
the cone 

u(x, 0 = 0 for |-r| < —f + /?. 

Note that such an incoming solution satisfies both boundary conditions discussed 
in section 35.4. Define again F as the initial data of incoming solutions. Properties 
(15a) and (15b) can be immediately verified as before, but property (15c) lies much 
deeper (see chapter V of the book by Lax and Phillips). We will return to this example 
in chapter 36, section 36.5. 

Another very interesting example is furnished by the automorphic wave equation 
in hyperbolic space. We will take it up in chapter 37. 


35.6 THE HEISENBERG COMMUTATION RELATION 

In quantum mechanics the state of a physical system is a unit vector u \\u\\ == 1, in 
a Hilbert space H over C associated with the physical system. Each observable is 
identified with a self-adjoint operator, constructed according to the so-called rules 
of quantization. 

Defini tion. Tht expected value of an observable A mstat^wjs defined 
Here we assume that u belongs to the domain of A. 


The U 
servable 
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the right side rewrite (30’）as 

o < (llAwll 2 - a 2 ) - f + (||Bw|| 2 - 6” f 2 = A 2 (A, u) - t + A 2 (B, u)t 2 . 

In words，the quadratic polynomial on the right is nonnegative for all realr. Therefc 
its discriminant is nonpositive: 

1 — 4A 2 (A, z/)A 2 (B, u) <0; 

this is inequality (29). 

What pair of operators satisfy the commutation relations (28)? An elegant arg 
ment of Wielandt shows that no bounded operators do. To see this, we deduce I 
induction from (28) that for all natural numbers n, 

= AB H (28 

Taking the norm of both sides and using the triangle and product inequalities on tl 
right gives 

< 2||A|| ||B ,, || < 2||A|| ||B|| ||B H _ I ||， 

which implies that ||B H — 1 || = 0 for /i > 2||A|| ||B||; so B ， 卜 1 = O. Backwa 
recursion based on (28") shows that then = O for all k. 

On the other hand, the operators A = B = 〆， acting on the Hilbe 

space L 2 (E) of functions /(/x) do satisfy (28)，for 

It was shown by von Neumann that this pair is, except for multiplicity and unitary 
equivalence, the only pair of operators satisfying the commutation relation •— 

- stating-precisely and proving this; we follow Weyl in reformulating (28). Consider 


UCsOBU (- s), (31 ： 

^hwUtJri^thTTOirafygfm^ 

冬 TJ(s) = iAU(s) = fU(^)A. - 
clt 

Differentiating (31) formally and using the commutation relation (28)，we get 
4>U(j)BU (-幻 =/UQ) [AB - BA] U (- 幻 = —I, 
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Integrating this relation gives 

UWBU (- •?) = B - rf ， (32) 

called the Weylform of the commutation relation. It is taken to mean that for all real 
values of s the self-adjoint operators on the two sides of (32) are identical. 

Exercise 4. Deduce from (32) that U(^) maps the domain of B onto itself. 

Exercise 5. Denote by V(0 the unitary group generated by /B..Deduce from (32) 
that for all real s and r. 

. . . V(sTVXt) ^ e ist Y(t)V(s). 

(Hint: Differentiate with respect to r.) 

The next result is due to von Neumann: 


Theorem 11. Let A and B a pair of self adjoint operators acting in a Hilbert space 
H t U(/) the unitary group of operators generated by iA. Suppose that the Weyl rela¬ 
tion (32) is satisfied. Then there is a representation ofH as L 2 {N, R) so that 



Proof. We remarked earlier that Sinai derived the translation representation theo¬ 
rem from von Neumann’s theorem. Since we have given an independent proof of the 
translation representation theorem, we are entitled to reverse Sinai’s proof. 

Let E ( 入 ） be the spectral resolution for the self-adjoint operator B: 

B = j XdE(X). (33) 

Then 

U(j)BU(-j) = J W(U(j)Ea)U(-j)) (34) 

and 

B — jI = J (；.- s)dE(k) = j XdE(X + s). (34 ’） 

The operators on the left of (34) and (34 7 ) are self-adjoint; the integrals on the right 
give their spectral resolution. Since, according to (32), the two operators are identi¬ 
cal, so are their spectral resolutions: for every Borel set T, 


U(j)E(7 , )U(-^) = E(r + ^). 


(35) 





with unprecedented explosive power. Th 
head of the Press Branch of the German 

to the physicist Carl von Weizsacker, ,, _ 

von WeizsaekeFr who on-September-4 T -1-941 rinformed-the-Abwehrv the Intelligence- 
Branch of the German High Command, and Bernhard Riist, Reichsminister in charge 
of the ongoing German uranium project, of which Heisenberg and Weizsacker were 
leading members. A “cultural” visit of Heisenberg and Weizsacker to Copenhagen 

-twweekHate 广 was-arranged-aHheHhigheslrteveh - 

-^It4s:ironieakhat the report irvthe-Swedish newspaper was premature; the American 
uranium project did not start until 1942. By an ever greater irony of fate, Heisenberg 
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36 

EXAMPLES OF STRONGLY 
CONTINUOUS SEMIGROUPS 


36.1 SEMIGROUPS DEFINED BY PARABOLIC EQUATIONS 

In chapter 16, section 16.5, we studied solutions of the heat equation 

ut = Uxx (1) 

that are defined for alU > 0 andalLr, and that decay sufficiently rapidly as |.t| oo. 
We have shown there, see theorem 13, that such solutions are uniquely determined 
for all t by their initial values. The solution operators S(t) relate initial values of 
solutions to their values at time t: 


S(t)u(0) = u{t). 


The results of section 16.5 can be summarized as follows in semigroup language: 

The solution operators S(t) form a strongly continuous semigroup of contraction 

operators in any of the L^(M) spaces, 1 < p < oo t and on the space of contmuous 
functions on R that are 0 at 士 oo. 

It was remarked already in chapter 16 that similar results hold for a far more 
general class of equations than the hearequation: we may replace in (1) the second 
space derivative of u by any second-order elliptic operator in any number of space" 
variables E acting on u: . 


u t = Ew, 


( 2 ) 


E = 9/ dj-^rbi 9/=- , 4* 


--(#)- 


where (ay) is a real, uniformly positive definite symmetric matrix whose entries are 
smooth functions of x. The coefficients Z?/ and c are also smooth functions ofx. The 
whole space may be replaced by a bounded domain in R n , on whose boundary u 
is required to satisfy a single boundary condition； say u = 0. The solution oper¬ 
ators again form a semigroup. The domain of the infinitesimal generator includes 
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all smooth functions that satisfy the boundary condition, and on such functions the 
generator acts as the operator E defined in (2’). A possible approach to proving the 
existence of solutions of (2) with prescribed initial values is to make the proper ex¬ 
tension of the operator E and then verify that E thus extended satisfies the hypotheses 
of the Hille-Yosida theorem. 


36.2 SEMIGROUPS DEFINED BY ELLIPTIC EQUATIONS 

In the first example we take the Banach space C(5 m ), the space of continuous func¬ 
tions u on the m dimensional unit sphere. For each such function u there is a uniquely 
determined harmonic function h = h (rw) in the (m H-1 )-dimensionaI unit ball which 
equals u on the boundary of the unit ball: 

A/z = 0, h(co) = u(co), coinS m . ( 3 ) 

We define the semigroup Z(r) as follows: 

^{t)u = h(e^a)), ⑷ 

where h is the harmonic function defined by (3). 

Theorem 1. 

(i) The operators Z(t) are contractions in the maximum norm. 

(H) 7j{t) forma strongly continuous one-parameter semigroup of operators. 

(Hi) For t > 0 the operator Z(/) is compact. 

Proof, (i) follows from the maximum principle for harmonic functions, (ii) The 
semigroup property is clear. Since if h(x) is a harmonic function, so is h{cx) for any 
constant c; therefore Z(t)Z(s)u = h^e^co) = = Z(*s. + r)w. Strong 

continuity is a consequence of the continuity of harmonic functions in the unit ball 
whose boundary values on the unit sphere are continuous. 

(iii) To prove the compactness of the operator Z(r), r > 0, we have to show that 
the image of the unit ball under Z(r) lies in a compact set. This image consists of 
the harmonic functions on the sphere |x| = e~~ ! which are bounded by 1 in the unit 
ball，According to the Arzela-Ascoli criterion (see the beginning of chapter 22) for 
precorapactness in the maximum norm a set of functions has to be equicontinuous. It 
is a well-known property of harmonic functions that on any compact subset of their 
domain of definition, their first (or any higher-order) derivatives are bounded by a 
constant multiple of their maximum on their domain of definition. Since uniformly 
bounded first derivatives guarantee equicontinuity, it follows that the image of the 
unit ball in C(S m ) under Z(r), r > 0, is precompact. □ 

In our second example we replace the maximum norm by the L 2 (S m ) norm. The 
same results hold, as well as an additional property: 
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Theorem 1’. 

(0 The operators Z ⑴ defined by (3) and (4) are contractions in the L 2 (S m ) nonn. 

(ii) Z(t)fonn a strongly continuous semigroup in the L 2 (S n ) norm. 

(iii) Z(t) is a compact operator for t > 0. 

(iv) The operators Z(t) are real symmetric. 
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/[W 


)k{q)-h{p)k r {q 


)fc.. 


⑻ 


where we have used the fact- that dq/dp -= -^-/-/?rNow-de#ne the-fuflct*iGn-£ by—- 

£(r, co) = k (^ ， . ， co); 

t is a harmonic function in the ball of radius p/q\ and 

e(p ， a>) = k{q,d)), 4 (/? ， co) = - k r (q.co) ‘ 

P 

Settihg this into (8 )， we obtain … " - --_ 

J [hr(p) Hp)- h{p)l r {p)] dco. (9) 

We recall now Green’s formula: 

/ - h M] dx = f [th n - U n ]dS. (10) 

«/G JdG 

Since h and t are harmonic functions, the left side of (10) is zero; therefore so is the 
right side. Now take G to be the ball of radius p; the normal derivatives h n and t n 
are then derivatives with respect to the radius r, and dS = p m dco. The right side of 
(10) is p m times (9). This proves that (9) is zero; therefore (8) is independent of p. 
Letting p tend to ^ and to 1， we deduce that the two sides of (6) are equal. □ 

We show next that the generator G of Z(/) defined in theorem V is self-adjoint. 
To see this, let u and v belong to the domain of G; since Z(r) is symmetric, 

{Z{t)u,v) = (u,Z(t)v). 

Differentiate with respect to t, and set t = 0: 

(Gm ， u) = (« ， Gu), 

proving that G is symmetric. We saw in chapter 32 that every complex number with 
sufficiently large real part belongs to the resolvent set of G. On the other hand, we 
have shown in chapter 33 that every unbounded symmetric operator whose resolvent 
includes points in both the upper and lower half - 1 ’ • ” . 





Phillips’ spectral mapping theorem (see chapter 34, section 34.5) that the spectrum 
also of G is discrete and accumulates only at 一 oo. It turns out that the spectrum of 
G can be determined explicitly: 

Let y denote a point of the spectrum of G; since G is self-adjoint, y is real. 
By the aforementioned spectral mapping theorem, e yt belongs to the spectrum of 
Z(0； since Z(0 is a contraction, y is < 0. Let e(w) be a corresponding eigenfunc¬ 
tion, h(rco) the harmonic function whose boundary value is e(co). By definition (4)， 
Z(t)e = h(e~ t w) = e y ， e(co). Setting = r, we get that 

h(rco) = r^ y e(co). (11) 

It follows from (11) that)/ is a nonpositive integer; otherwise, the function r^ y e(co) 
would have only a finite number of derivatives at r = 0, contrary to the fact that 
harmonic functions in the unit ball are infinitely differentiable (even analytic) at the 
origin. 

Theorem 2. The spectrum of the generator G of the semigroup Z(t), defined in the¬ 
orem l f , consists of all integers < 0. 

Proof. What remains to show is that every nonnegative integer n belongs to the 
spectrum of G. Consider the space P of homogeneous polynomials p(x 0 , … ， x m ) 
of degree n; denote its dimension by d n . The Laplace operator maps such a p into 
the. space of homogeneous polynomial of degree n — 2. By linear algebra, a subspace 
of P of dimension at least cl n — d n -i is mapped into zero; that subspace consists 
of harmonic polynomials. The boundary value e(co) = p(co) of such a polynomial 
satisfies 

Z(t)e = pie^co) = e~~ tu e(co) t 

and so is an eigenfunction of Z(t). Differentiation with respect to t shows that e is 
an eigenfunction of G with eigenvalue —n. □ 


36.3 EXPONENTIAL DECAY OF SEMIGROUPS 

-In-ehapter34—seetioir34Tl7-we-saw-thtit~a-strongly-continuous~one=parametersemi- 
group-grows-at most-exponentially-as t oo. In this section we investigate how fast 
such a semigroup can decay. 

Suppose that the infinitesimal generator G, acting oh a Hilbert space H, is self- 
adjoint and bounded from above. Using the spectral resolution of G we can express 
the semigroup as 
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It follows from this formula that for / ^ 0, Z(/) decreases at some exponential rate 
as / tends to oo. The main result of this section (see Lax) includes a perturbation of 
this result. 

Theorem 3. Let H be a Hilbert space, G the generator of a strongly continuous 
semigroup ‘ Assume that there is an infinite sequence {|； ? } of real numbers, tending to 
—oo, such that the resolvent of G is uniformly bounded by some constant d 一 ' on all 
lines ReA = :知 U l J^ e . com P^ ex plane: 

m^Xir ] \\<d-\ Re 入 = 6. (12) 

— 0 < t f be a vector-valued junction- whose values lie in the domain ofG. 
Assume that Gu{t) is a strongly continuous function of t, and that u is strongly 
differentiable, aud its derivative u t is strongly continuous. We require u{t) to satisfy 
for all t the inequality 

\\ut~Gu\\ <^||«||, 0<t 9 (13) 

where k is some constant less than d. Then, unless identically zero, u{t) decays no 



function of t, and that u(t) has a strongly continuous first derivative with 
.Then 


d 2 f \\u(t)\\ 2 e~^'dt< [ \\Gu - utfe-^ dt, (15) 

m Jr ■ 

provided, that the integral on the left is finite. 

Proof. Define v(t) = then (Gh - u,)e~^ = (G — f )u — u,. I 

transform is (G — 与 — ir)v(t), where v is the Fourier transform of v. 
assumption, ||(G - A)~ ] || < rf" 1 forReA = ^, 

d 2 ||S(T)|| 2 < ||(G-f- !.r)fi(T)|| 2 . 


According to Parseval’s theorem, the Fourier transform preserves the L d 
grating (16) we obtain (15), provided that v is in L 2 . 
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Exercise 2. Prove Parseval’s theorem for functions whose values lie in a Hilbert 
space. 


We turn now to theorem 3. First we extend u(t) for negative values of t, such as 
by setting u(t) = u(0)a(t) for t < o, where a(t) is a smooth function of compact 
support, a(0) = 1. We apply inequality (15) to u thus extended, with % — where 
(12) is satisfied. On the right, we apply for r > 0 inequality (13), and on the left, we 
drop the integral over M_. We get 


d 2 J^ Mt)\\ 2 e- 2 ^dt<K n +k 2 J^ Mt)\\ 2 e- 2 ^ dt, (17) 


where K n is the integral of \\Gu - u t \\ 2 e^ nt over and therefore tends to zero 
as n tends to oo. Since k is less than d, we deduce that 



\\um 2 e- 2 ^ dt < K n /(d 2 - k 2 ), 


(170 


provided that the integral on the left is finite. If, contrary to (14), the left side is finite 
for all it follows from (17’）that u(t) = 0 for alU > 0. □ 


Note that in the proof we only required the resolvent of G — 入 to be bounded as 
in (12), and not that it be defined everywhere. 

Note that for G self-adjoint, condition (12) is equivalent to this: 

The intervals — d，+ d) are free of the spectrum of G. 

So as a corollary of theorem 3 we get 

Theorem 3\ Let G be a self-adjoint operator, bounded from above, whose spectrum 
has infinitely many gaps of width 2d. Let u(t) be a vector-valued function as in 
theorem 3, satisfying inequality (13); then, unless identically zero, u(t) decays no 
faster than exponentially, in the sense of (14), 

The restriction k < d is sharp; an example to the contrary-w-hen-fe-c- d-is-given-in—- 
Lax. 

Here is an application of theorem 3\ Let L be a partial differential operator of the 
form L = A-c, where A is the Laplace operator and c is the operator of multiplying 

- 細班 led - to - be -s m eoth-aid-pe^tiver-The-i^-analegtie-of-the — 

maximum principle-(5) ： holds^for^olutions-otLiu-= - 

Lemma 5. Let w be a solution of Liy = 0, L = △ — c; c positve. Then 
J w 2 (r ， co) dco 


/(r) = 


(18) 
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Proof. DMmntiate {I S) with respect to r; we get 


(180 


Next we use Green’s theorem to transform the following integral over the ball B r of 
radius r:. 


wLw dx 


(wAw — cw 2 )dx= ww,.dco - I (wl-^cw 2 )dx. 
. Js r - J Br . 


Since c is a positive function, the positivity of (180 follows, and so does the lemma. 

□ 

We designate by H the Hilbert space L 2 (*S), S, the unit sphere. We take G to be 
the generator of the semigroup Z(r) discussed in theorem V of section 36.2. Recall 
that Z(t) is defined by equation (4): 


Z(t)u = hie-%), 

where h is the harmonic function defined in (3): 

A/z = 0, h{co) = u(co), coin S. 
Differentiating (4) at r = 0 gives 

Gw = 一 h r . 


(4) 

⑶ 

(19) 


Let w be a solution of Liu = 0 in an open set containing the unit ball. Define u (t) 
to be wie^co). Then at / = 0 


u t = -w r . ( 20 ) 

We now show that this function u satisfies inequality (13) of theorem 3, with k = k(t) 
a function that tends to zero as / tends to 56. 

Let v be any smooth function on 5; define p(x) to be the harmonic function in the 
unit ball whose value on S is v: 

△p = 0， p(co) = v(co). (21) 

Multiply (21) by h and integrate over the unit ball. Since h and p are both harmonic 
functions, Green’s theorem applied on the unit ball B gives 

0 = / ( 卩 r , 7 - Phddco, ( 22 ) 

A similar application of Green’s theorem gives 
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This is inequality (1 
Recall that u.(t) i 
function w(t) = wie^x) satisfies the differential equation 

△u;(r) = e^ 2t (Aw)(e^x) = e^ 2t c(e- f x)w(t). 
Therefore applying (26) to «(/), w'eget 


||Gw — M/|| 5 ^pjC max |MI. (260 

This is inequality (13), with k(t) = e^ 2t c mQX y(m 4 - 1). 

According to theorem 2 the spectrum of G lies on the negative integers; therefore 
it contains infinitely many gaps of length 1. The factor on the right in inequality (26，） 
tends to zero as ? tends to oo. Therefore we can apply theorem 3 f to u(t) = wie^co) 
and conclude, as in (14), that unless u; s 0, there is a constant b such that 
poo 

J o ||«(011 2 ^ dt = OQ. .(14) 

Denote e~ l = r, and x = rco. Since dt = -rdr, and dx = r m dcodr, we can 
rewrite (14) as follows: 


We can restate our conclusion (14’）as follows: 





only in the case when the function c(x) is not analytic. The first such theorem was 
obtained by Carleman in two space variables, and by Mullerforany m;a very general 
result is due to Calderon. ^ 


36.4 THE LAX-PHILLIPS SEMIGROUP 

In chapter 35, section 35.5, we introduced the concept of a translation representation 
of a group of unitary operators U(r) acting on a Hilbert space 丑 • A key role there was 
played by a subspace F , which we will here call an incoming subspace and denote 
as F 一. The incoming subspace, F_. is assumed to have the properties enumerated in 





product on the right in (30) is zero. Since P+U(0 众 differs 
F+, assumed to be orthogonal to F 一， it follows that P+l 
F-. 

Next we show that Z(t) forma semigroup. According t< 
U(r) maps F+ into F+. Since P + removes the F+ compo 
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Thus we can write for/: in AT that . 

mZ(s)k = P+U(r)P + U (啪 — 

To show that Z(0 tends strongly to zero, we use property (28c) that the union of 
U(f)F+ is dense in H. Therefore given any k in K, and any e > 0, there is a vector 
/+ « ^+. and r such that U(/-)/+ differs by less thane from k. Since P + U(r) is a 
contraction, .—.' - . ■ : - 咕 .‘ .•“ 一心 _^ ： —^-一 

||P + U ⑴- U(r)/+)|| <e. 

But by (28c) for t +?* > 0, P+U(r+r)/ + is zero; therefore for / > — ||P+U(/) 々 || = 

||Z(iT)/r|| < s. • - - - ::: - ....-- - 口 

In the next section we will present an example that lends substance and interest to 
the abstract theory described in this chapter. An equally interesting example will be 
given in chapter 37, section 37.9. 


36.5 THE WAVE EQUATION IN THE EXTERIOR OF AN OBSTACLE 


Let 5 be a smoothly bounded domain, the obstacle, in E 3 , contained in the ball 
of radius R, R some positive number. At the end of section 35.4 of the preceding 
chapter we have looked at solutions of the wave equation 


Mu — An = 0, 

defined for all x in the exterior of B and for all times, and which are zero on the 
boundary of B. We have shown there that such solutions conserve energy, that is. 


E + ul.)dx 


(31) 


is independent of time, where the integration is over all points x in the exterior of B. 

The square root of energy is the energy norm; we denote by // the completion in the 

energy norm of all initial data {w (0), W/(0)} that are smooth, of compact support in 
the exterior of B, and are zero on the boundary of B. We denote the energy norm of 
the initial data/ = {/】，_/ 2 } as ||/|| £ . 匕 

As already remarked in section 35.4, using techniques in the theory of partial dif¬ 
ferentia] equations one can show that there exist for all times, positive and negative, 
solutions of the wave equation in the exterior of B with prescribed smooth initial 
data with finite energy that are zero on the boundary of B. Conservation of energy 
shows that these solutions are uniquely determined by their initial data. So we may 
speak of the operator U(/) that maps initial data into data at time t: 


U(r ； 










r lim inf||U(O/ll£,G = 0. (320 


For a derivation of property (27c) from (32)，and a proof of (32) we refer to chapter V 
of Lax and Phillips. We remark that the key step in the. derivation of (32) is to show 
that the generator of the group U(/) has no point spectrum. 
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The closure in the energy norm of the initial data of outgoing solutions forms the 
outgoing subspace Fj^.. As before, properties (28a) and (28b) are easily veri fied ， 
while property (28c) is hard work. 


Lemma 8. F 一 and F+ are orthogonal to each other. 

Proof. We mimic the constmctignjUncqgiing data^as described at the^end of_ 
section 35.5, chapter 35: choose any number T greater than R. It follows from Huy- 
gSns’s principle that a solution u(x ， t) in free space R 3 x R whose data at time T are 
zero for |x| > T — 7? is incoming for r < 0. 

It follows from the conservation of energy that if w and v are two solutions of the 
wave equation in free space for 0 <t < T , then the ener-gy scalar product 

(_，“,(/)}， ⑽)，⑺})五 

of two solutions of the wave equation is independent of t. In particular 

({« ⑼， _)}， [v(o), v t (o)}) E = ({w(r )， ii t (T)}, {u(D, v t (T)}) E . (33) 

For h constructed as above, and v any outgoing solution, the right side of (33) is 
zero, since the functions u(T) ， u./(T) are supported inside the ball of radius T 一 R, 
whereas an outgoing solution v is zero there. 

The argument above makes plausible but doesn’t quite prove lemma 8, since we 
haven’t shown that the incoming solutions u constructed above with the aid of Huy- 
gens’s principle are dense among all incoming solutions. I assure the reader that a 
straightforward proof of lemma 8 can be found in Lax and Phillips. □ 

The use of Huygens’s principle to show the orthogonality of incoming and out- 
going data is quite natural, for in two space dimensions; where Huygens’s principle 
fails，incoming and outgoing data are not orthogonal 

The semigroup Z(r)~is a natural tool for studying the interaction of waves with 
the obstacle. Incoming waves have not yet interacted with the obstacle，and outgoing 
waves never will. Their removal by the projections P_ and P+ clears the way for the 
study of waves that interact all the time with the obstacle. 

There is an intimate relation between the shape of the obstacle and the spectrum 
of the operators Z(0 and their infinitesmal generator G. We start with a preliminary 
result: 

Theorem 9. Let k be any positive number. The operator (kl — G)~~ ] Z(2R) is com- 
pact. 

For proof we refer to chapter V of the Lax and Phillips book. A fairly immediate 
consequence of theorem 9 is 


Corollary 9’. G has a pure point spectrum accumulating only at oo. 
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Exercise 3. Deduce corollary 9’ from theorem 9. (Hint: Use the spectral theory of 
semigroups described in chapter 34, section 34.5.) 

The link between the geometry of the obstacle and the spectrum of Z(0 is the 

geometrical optics description of wave propogation. The relevant geometric property 

is how long the obstacle can retain a ray, defined as a path consisting of straight line 
segment, reflected at the boundary of the obstacle according to the classical laws 
of reflection. Denote by i{B) the supremum of the length of reflected rays that are 
contained within the ball of radius R. 

Theorem 10. 

(i) If 1(B) < oo, then Z(t) is compact fort large enough. 

(ii) Ifl{B) = oo, ||Z(OH = \ for all t. 

The proof of this theorem is too technical to be included here; a few remarks 
will have to suffice. Lax and Phillips have pointed out that part (i) would follow 
from a generalized Huygens principle, which states, roughly, that in the exterior of 
an obstacle the sharp part of signals propagates along rays, including rays reflected 
from the obstacle. This generalized Huygens principle was proved be Melrose and 
Taylor. 

Combining (i) with local energy decay, we conclude that all eigenvalues of Z(t) 
are of the form e yot , 0 > Re yy > ... — —oo. It follows that ||Z(Oil decays expo¬ 
nentially as r oo. Lax, Morawetz, and Phillips have shown that for star-shaped 
obstacles (for which i{B) < 2R), ||Z(Oil £： < e~ at , a > 0. Their proof is based on 
the nonstandard energy estimates of Morawetz, and does not rely on the generalized 
Huygens principle. 

Part (ii) is based on the notion that the wave equation has solutions whose energy 
is contained in an arbitrarily preassigned neighborhood of any given reflected ray. 
Such solutions were constructed by Jim Ralston. 

When t(B) = oo, the real parts of the eigenvalues of the generator G do not tend 
to —oo. Interesting information about their location was obtained by Ikawa. 
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SCATTERING THEORY 


Two souls dwell in the bosom of scattering theory. One is mathematical, and handles 
the unitary equivalence of operators with continuous spectra. The other is in physics, 
and deals with such notions as quasi-stationary states, cross sections, and what is 
observable in quantum mechanics. 


37.1 PERTURBATION THEORY 

Perturbation theory was developed by Lord Rayleigh, and by Erwin Schrodinger 
as a means of solving problems in physics, classical and quantum mechanical. A 
rigorous theory was set in place by Franz Rellich. In this section we describe the 
simplest results; an extensive discussion can be found in Kato’s magisterial book. 

Theorem 1. Let Abe a self-adjoint operator in a Hilbert space H, and a an iso- 
lated eigenvalue of A of multiplicity one. Let D be a bounded symmetric operator, 
||D|| 5 1. Then for € small enough A + eD has an isolated eigenvalue of multiplicity 
one in [a — a 4 - €]. 

Proof. We need 

Lemma 2. Let'? and Q be orthogonal projections in a Hilbert space whose differ¬ 
ence has norm less than 1; 

_ .….… „— __IIP- QII <X, _ _ __........__—. ⑴― 

Then the range of P andQ have the same dimension. 

Proof. If the dimension of the range of P were greater than that of Q, then there 
would be a nonzero vector u in the range of P that is orthogonal to the range of Q. 
For such u, Vu = u, Qu = 0, so ||(P - Q) «|| = \\u\\, contrary to (1). □ 
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Let C be a circle centered at the isolated eigenvalue a of A, with radius so small 
that C 1 lies in.theje&olv.ent^etx)f. A J!.,etthe. o jily^point.of-Xhe spectrum ofA contained 
inside the circle C be a. Define the mapping P by 



⑵ 


Clearly,.P is the projection E(a)where. E is the. orthogonal projection 二 vaiuecLmea-— 
sure entering the spectral resolution of A. By assumption, the range of P is one 
dimensional. 


For e small enough, f - (A+^D) is invertible for all f on C, and its inverse differs 
little from (f = A):! • So we-may-xtefine-E € ~by .. .. …… 

P e = f ((-A-eDr 1 奸； a') 

is also an orthogonal projection, and ||P-P e || tends to zero as e tends to zero. It 
follows therefore from lemma 2 that for e small the range of P € is one dimensional; 
every vector in the range is an eigenvector of A + eD. The corresponding eigenvalue 
tends to a as ^ tends to zero. □ 


Exercise 1. Prove that this eigenvalue of A -f ： differs from a by < €. 


Corollary. It follows from formula (2 ’）that the eigenvector and eigenvalue o/* A 4 - 
depends analytically on e. 


Exercise 2. Denote the eigenvalue of A+eD by a(€), the corresponding eigenvector 
of unit norm by u(e). Show that 


d 


«). 


How about higher derivatives? 


Remark. Suppose that a is an isolated eigenvalue of A of multiplicity n. The ar¬ 
gument used to prove theorem 1 shows that in this case A + eD has n eigenvalues in 
[a — 6 , a + €]，some of them possibly multiple. 

From formula (2’）we can conclude that the eigenvalues of A + eD are algebraic 
functions of e，with a possible algebraic singularity at € = 0. Rellich has pointed out 
that an algebraic function can be expanded in a Puiseux series, a power series in e 丨"，， 
P the order of the branch point at ^ = 0; all branches of this power series represent 
solutions of the algebraic equation; that is, they are eigenvalues of A + ^D. But for 
P 7 ^ 1 some of these branches are complex valued for € real; since all the eigenvalues 
of the self-adjoint operator A+are real, p must be 1. So all eigenvalues of A+eD 
are analytic functions of e. n 











We turn now to the other extreme: 


Theorem 3. Let A be a self-adjoint operator in a Hilbert space H, a an essential 
point in its spectrum, that is, for every interval I containing a, E(/) has infinite- 
dimensional range. Let C be any symmetric compact operator; then a belongs to the 
essential spectrum of A+ C. 

Proof. If not, for some interval I containing a, the range ofEc(/) would be finite 
dimensional, where Ec is the spectral resolution of A + C. Adding another compact 
operator to A + C would eliminate the finite number of eigenvalues in I altogether. 
So it suffices to prove that a belongs to the spectrum of A + C, 

To see this, we show that the norm of the resolvent of A + C at a + 巧 tends to cx) 
as ?7 approaches zero. Decompose the compact operator C as 

N 

C = F + S ， where F = ^ / ； )/y ； l|S|| < rj. (3) 

l 

For every .r in the range of E(of - -f rj), ||(A - a - irj)x\\ < 2 ? 7 ||x||. Since 
the range of E(or — ??， a + 77 ) is infinite dimensional, we can choose x so that it is 
orthogonal to all the/y, / = 1， For such an a:, Fx = 0. Using this, as well 
as (3), we get 

\\(A-bC-a-irj)x\\ == \\(A-a-iri)x+Sx\\ < ||(A—a- 鋪 x|| + ||S;c|| < 3??||x||. 

( 4 ) 

This proves that the norm of (A + C - a — £•??) 一 1 is greater than l/3rj. □ 

Theorem 3 is slightly misleading. Suppose that the spectrum of A is absolutely 
continuous; every point of it belongs to the essential spectrum of A, and therefore to 
the essential spectrum of A + C，C any symmetric compact operator. But, as Weyl 
observed, the spectrum of A+C need not be continuous; he showed that there exists, 
for any positive €, a compact operator C, ||C|| < €, such that the spectrum of A + C 
—_is_p.ure_poinLspectrum,..denseJa the -formerly continuous spectrum. This result — 
spectral curdling 一 was sharpened by von Neumann, who showed that C may be 
taken not only compact, but of Hilbert-Schmidt class (see chapter 30, section 30.8), 
with arbitrary small Hilbert-Schmidt norm. 

- For-a^ymmetric-GompaGt-operator-C T -with--eigenvalues-{-yy} 7 -the-Hil^ 

-—nora>is 4 £: 4 ^):【/- 2 、More generally one can define the 77 -cross norm as (5Z \Yj\ p ) 
see Shatten. Kuroda has further sharpened von Neumann’s result by showing that C 
may be taken to have arbitrarily small p-cross norm, p > 1. 

For p = 1， the p-cross norm is the trace norm; see chapter 30, section 30.2. For C 
of trace class the story changes dramatically: Marvin Rosenblum has shown, in his 

dissertation, that perturbing a self-adjoint operator by adding a symmetric operator 

of trace class leaves the continuous spectrum continuous. The next two sections are 
devoted to this result; we follow Kato’s formulation. 









37.2 THE WAVE OPERATORS 


~Tfie scene'Is^^separabfe - HTIBeft space //「arid a pair of self-adjoint operators, A 
-and-B7-A-s-we-obser-ved4n-ehapter-35—A -and-B-generate- one-parameter-unitary 
groups, which we denote as e il ^ and e //B . Define the one-parameter family of unitary 
operators W(r) by ... : ___ 

It is the strong limits of W(r) as t tends to 士 oo that occupy center stage: 
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>rm the difference quotients 

_ t isB _ j 

W + 一 —— v = f —— -W + v, 
s s 

lere i; is a vector in the domain of A. Take the limit ^ 0; on the left we get 

V+ Au. Therefore the limit on the right exists too, so W+ v lies in the domain of B 






Suppose that not only the wave operator W+(B, A) but also W+(A ， B) exist; then 
it follows from ⑻ that W^(B, A) = W(A ， B)，so by ⑺ W* (B, A) is an isometry. 
It follows that its nullspace is trivial, so K = H. We summarize: 

Theorem 4. If both wave operators W+(B, A) and W+(A ， B) exist, then A and B 
are unitarily equivalent. 

Operators that are unitarily equivalent have the same spectrum. In particular, they 
have the same point spectrum. SinjQ ❿ ig 旦敗 tingJli 即 iy_BlsiperturbatiQn _QtA ， it.. 
is highly unlikely that a perturbation would leave the eigenvalue of A unchanged; see 
exercise 2. It is clear that if A has a point eigenvalue, the wave operators Wi(A, B) 
would exist only under the most exceptional circumstances. However, this is easily 



… •，— _ 細篇 4._ 


Exerciser In chapter 31, section 31.4, we dealt only with bounded operators; show 
how to extend these notions to unbounded operators. 

Definition. The generalized wave operator,, also denoted as. W+ (B,..A).is defined as 
Jm^WWPc^Wi, (60 












Exercise 5. Show tHat'iflhe generalized 碎 SStor W;(B ， Ay exislinrSiifs— 

H ⑹ onto a subspace K of H that reduces the operator B, and that B has an abso¬ 
lutely continuous spectrum on K. 


3^ rc ^ se ^ Suppose that both generalized wave operators W+.CB,. A) and W+(A ， B) 
exist; then the absolutely continuous parts of A and B are unitarily equivalent. 


37.3 EXISTENCE OF TIDE WAVE OPERATORS 

The following result is due to Cook; see also Jauch and Kuroda: 






||W+ M-W(a)«|| z = 2||«r-2Re (W(a； 

Expressing W+m — W ⑷ 《 on the righ 
identity (9)，with j replaced by —t, gives 

Theorem 7. (Rosenblum) Suppose that t 
an operator D of trace class. Then the g 

W 丄 fA. m 




we aenote oy a me smallest ciosea suospace 01 n mat comams j una reauces 
the operator A. The subspace K can be obtained as the closure of all -vectors of 
the form b{A) /, where b are bounded, continuous functions on M; see chapter 31, 
section 31.5. Note that B = A on the orthogonal complement of K, so for u 丄尺， 
W(0 is the identity, as are the wave operators. It suffices then to prove the existence 
•of the wave operatorsi ; ori\rand"BTestrrcted _ tcr/f; 

Decompose / as •-- 一 、二 :-- — - --- 

f = 8 + h, T= a-F c )f. 

The closure of the set of vectors b(A) g is the subspace on which A is absolutely 
continuous; A is singular on its orthogonal complement. Since g belongs to ⑻， the 
measure (Eg y g) is absolutely continuous. So K^ c) can be represented by L 2 (S), S 
some Borel subset of M, and A acts as multiplication by X in this representation. 
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With Dw defined as c(w ， 7) /, we take for simplicity c = 1, and define 
D c u = P c DP C = (P c w , f ) V c f = (u, g) g; — 


g = P c f is represented by a square integrable function supported on S. Thus the 
existence of the wave operators W+(B, A), B a rank one perturbation of A, has been 
reduced to the case where the Hilbert space is L 2 (5), A is multiplication by X,-and - 
A.+ D, Du = {u r g) g. . . ... . 

We can regard L 2 (S) as a closed subspace of L 2 (M) = H. We extend A to H as 
multiplication by X, and Bu as (w, g) g, where g is defined as zero on the complement 
of 5*• It suffices to prove the existence of W+ on the extended space. 

W 色 prove the existence of the wave operators first in the case where 茗 is a smooth 
function, it and its derivatives rapidly decrease as A tends to ±oo. We appeal to 
theorem* 5; we choose the dense subset J to consist of all smooth functions u(x) 
that decrease rapidly together with their derivatives as 入 — ±oo. We verify now the 
three hypotheses of theorem 5: 

(i) The domain of A and B consists of functions uQ,) for which 入 m ( 入 ) is square 
integrable; hypothesis (i) is clearly satisfied. 

(ii) ' 

D c e- i,A u= (e- ilA u ， g) g 'J e - 卟 u Qm) 伽 ) dfMg(k) 

= ^g(-t)g(k), ( 12 ) 

where — denotes the Fourier transform. Since both w and g decay rapidly at infinity, 
ug is in L\ and so its Fourier transform js continuous, as required in (ii). Since 
the derivatives of u and g decay rapidly, ug decays faster than any power of t as 
1 ^ 士 o°; therefore requirement (iii) is satisfied. So we conclude that the generalized 
wave operators W 土 (A, B) exist. 一 

•To pass to arbitrary 名 ， we make use of the identity (11). Replacing De 一 //A m in 
this formula by its expression in formula (12), we get 

IIW+ « - W ( a) m || 2 = -2Im ^°° (e i,A Wlg,u'j dt. (13) 
Abbreviate W^. g as^*; then we can rewrite the second factor in the integral above as 

(e ilA g*, m ) = J e i,x g*{X)u{X)dX = 

We estimate the integral on the right in (13) by the Schwarz inequality: 


r +u — W(a) «|| 2 < 2 




ity the Fourier transform is an isometric mapping times 
I right side above is less than 


\S*\ 2 dX, 


2n J \g*u\ 2 dk<2n\u\l 
laximum of |w ( 人 )| on R. Furthermore 
f lg*l 2 ^ = ||^|| 2 = ||W^ g || 2 <|| 5 || 2 , 


i adjoint of the isometry W + , has norm equal to 1. Putting 
we get 


- W(fl) u\\ < (8^)'/ 4 \u\ X £- |i7|| 2 ^ 1/4 . 

holds for a replaced by b\ so by the triangle inequality 
«ll <( 87 r )'/ 4 | u |^ 2 


|[f 


(0! 2 ^ 


li/4 


{t)\ 2 dt\ 


proved under the assumption the u and 2 are smooth and ] 





= 八卞、 • ， g n )g n . gn g, [ne rignt side of 
ide of (130; we claim that so does the left side. 

: rate that e iBnt tends to e iBt ; this is easily done, 
s the solutions of the differential equations 

n — ^B /7 u n = 0, w(0) = « /z (0) = w, 

3. Subtracting the equations, we get 

fin) 一 Dik. ____ 

integrating, we get 

i f S e i{s - ,)B (B-B n )u n dt. 

Jo 


II， we deduce that 








As g /2 tends to ||B - || = || (•； g) g 一 (• ， 茗 / 1 )如 || tends to zero; this completes 

the proof that tends to e /B/ in the uniform topology; strong convergence would 
have been enough. This completes the proof that the inequality (13’）is valid .for all 
g in L\ and all smooth u that decrease rapidly at oo. 

The right side of (130 tends to zero as a and b tend to oo; therefore so does the 
left side. This proves the convergence of W(0 u as t tends to 00 , Since the smooth, 
rapidly decreasing functions u are dense in L 2 (R), and since the operators W(r) 
_^§y. e ... n 9 rm 1 ，卞 follows that the limit exists for all w in L 2 (R), and therefore the 
generalized wave operator W+ (B, A) exists. Since the role of A and B is symmetric, 
the wave operator W+(A, B) also exists; this shows that the absolutely continuous 
parts of A and B are unitarily equivalent. 

.-We pause-for a moment to-point out that even when the operator A has absolutely 

continuous spectmm，a rank one perturbation can create a point spectrum of B. Take 

A to be multiplication by x acting on L 2 (M) S and D = (., /) /, / in L 2 and smooth 
(Lipschitz continuity will do); we are looking for an eigenfunction w of A + D, with 
eigenvalue of r: 

xu(x) + (m, /) f(x) = r u(x). 

Solve for m(jc )： 

w(x) = y( x ) 

r 一 x 

In order for u to be square integrable, r has to be a zero of /. Taking the scalar 
product of both sides with / gives，after canceling the factor («，/)，the relation 

dx = 1, 

a second condition, in addition to f(r) = 0. If both conditions are satisfied, A + D 
has r as an eigenvalue. 

Exercise 7. Show that / can be so chosen that A+D has 72 eigenvalues, n any natural 
number. Can B have infinitely many eigenvalues? 

The eigenvalue r can disappear at the slightest change in the function /; for al- 
tliough the changed / will have a zero near r, the second integral condition will 
in general not be satisfied. This phenomenon is called the instability of the point 
spectrum embedded in the continuous spectrum. 

We return now to complete the proof of theorem 7. 

Let D be a symmetric operator of trace class. Denote its eigenvalues by dk, the 
corresponding normalized eigenvector by / 々 • By (20) of chapter 30 ， 
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and D itself can be expressed as 

d = ,/*)A- 

Denote by D, z the finite sum 

D„ = X]4( ， fk) fk. 

1 

Define B n as A + D n . By what we have already proved, the generalized wave oper¬ 
ators W+(B n , B , 卜 i) exist. Therefore (see exercise 3) the wave operator W + (B n , A) 
exists; we denote it as W ；I +. We use now formula (11): 

nOO . \ 

||W n+ «-W„(fl)«|| 2 = -2Im j (e itA - W* + D„ e-' ltK u,u) dt. 

Using the definition of D n , we can write this as 

||W n+ M- W „ ⑷ M || 2 = -21m J^°T,d k (e- i,A u,f k ) (e ilA W* +{ f k , u)dt. 
Using the Schwarz inequality, first with respect to the sum, then the integral, we get 
l|W /I+M - W„(a)H|| <2^141^1 dt 

i/7 

•[ 零 I4lf>”，“)| 2 ^] ' 

where is an abbreviation for fk. As we saw earlier， ||/^|| < ||/IU = 1- 
The vector it for which we want to prove the existence of the strong limit W(f) u 
as f -> oo belong to the absolutely cpntinuous subspace for the operator A, so 

- w-= Replace-w-by P c w-on the right in (14); using the fact that P c commutes 

with e itk , we can rewrite the right side of (14) with fk and replaced by P c fk 
and P c /^. Rather than rewriting (14)，we just assume that fk and belong to the 
absolutely continuous subspace of A. 

- Next-we-use-the-speetral-resolution- of-A-to-rewrite-the^terms- on the right in (14) 

- —-^^as=followsr 〜 - — 

(.e~ ilX u,f) = J e~ i,k d(Eu,f) 

and 

(e 1 •’ A /*, u) = J e itX d(Ef*,u). 
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Since u, f, and belong to the absolutely continuous subspace of A, we can rewrite 
the integrals on the right in terms of the Radon-Nikodym derivative as 


etc. 


By the Schwarz inequality, 


(E«,?)<[|(E W ； m) |(E/ 1 /)' 


1/2 


Exercise 8. Prove this inequality. 

Suppose that the vector u has the property that 


(15) 


sup — (Ew, u) < oo. 


Then, by the Schwarz inequality. 


^E( H! /)< m [A (E/i/) 


11/2 


(16) 


where m is the square foot of the sup (]6). This shows that d(Eu, f)fdX is square 
integrable, and so by Parseval’s theorem, (15) is square integrable as function of t. 

Denote the function (15) as and the corresponding integral with / replaced 
by f* as 77 *( 0 . Setting these in (14), we get 


(140 


||W„ + m -W„(a)«j| 2 1^1 J 141 J \nt\ 2 d^ 

By ParsevaFs formula, 

r ' 從心 / 二响 2 4 i 去 ㈣ '4“ 

<2jtm 2 J ^ (E/*, f*) dk = 2n m 2 \\f*\\ 2 <2n m 2 . 

So we can rewrite (14’）as 

||W, !+ « - W„(fl)M|| 2 < (8tt) 1/2 \ d k\ l%l 2 *j mllDllt 1 / 2 ; (14") 


here we have used the fact that J2\ Wk I < l|D|| tr . Replacing aby b and using the 
triangle inequality, we get from (14〃）that 
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\\W n (b)u-W n (a)u\\ < 


(S^HDIItr)'/ 4 m \rik\ 2 dt 


1 1/4 、 


Now we pass to the limit as n tends to oo. Since B /z = A + D n tends uniformly to 
B = A + D ， W /Z (a), W,i {b) tend uniformly to W ⑷， W ⑹ ，The right side tends to 
the infinite sum. In the resulting inequality we let a and b tend to oo. We claim that 
the right side tends to zero. This follows from these facts: 


(i) r \nk(t)\ 2 dt<2nm 2 . 
J—oo 

(ii) |4I < oo. 

poo 

(iii) lim / \r)k\ 2 = 0. 

a—aJa 


This proves the existence of the strong limit W (b) u as b tends to oo. 

This conclusion is valid for those vectors u that satisfy (16). The last step of the 
proof is to verify that the set of such u is dense in H ⑹. This is not hard; let v be 
any vector in H( c ). For such a v, the measure (Eu, v) is absolutely continuous with 
respect to Lebesgue measure, so we can write 

Hull 2 - f d(Ev ， v) = j 去 (Eu, v) dk. 

d(Eu, v)/d 入 is nonnegative and in L 1 . Denote by S m the set of k where 
d(Ev, v)/clk > m. Set v m = (I 一 E(5 m )) denote by the measure 

v m ). 


If we denote the measure (Ev, v) by /z, the two measures are related by dfx m == 
(1 一 c s )dfM, where c s is the characteristic function of the set S m . It follows that the 
Radon-Nikodym derivatives are similarly related: 

—(Ev m , v m ) = (1 — cs) — (Ew, «)♦ 

Since S, n was chosen as the set where dji/dX > m, it follows that dix m /d\ > 爪 £ 
a lUL—T he vectors Vm tend to i> as m tends to oo. This completes the proof 
that the vectors u that satisfy (16) are dense in H( C K Since the domain of the wave 
operator is a closed subspace of H^ c \ this completes the proof that the generalized 
wave operator W+(B ， A) exists when A and B differ by .an operator of trace class. 

□ 


.Of course, the wave operator W- CB, A) also exists, and since the A and B enter 
the hypothesis symmetrically ， W + (A, B) also exist. Therefore the absolutely contin¬ 
uous parts of A and B are unitarily equivalent. 


4 
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37.4 THE INVARIANCE OF WAVE OPERATORS 

The scope of the main result of section 37.3 is greatly extended by the following 
result:Xet *^ (A) be a real valued function with the following properties: 

—:Tirfisp&6 研茲免 f^renf 诏 ble.. 

…⑹ is' positive，continuous and of bounded variation locally. 

Theorem 8 (Birman-Kato). Let A and B be a pair of self-adjoint operators on a 
Hilbert space that differ by an operator of trace class. Let be a function.as above. 
Then the wave op£mtcuLW^0p^.(JB). i <j).(A))..exisLajid.are independent of(j). „ . 

Since W 士 (0(A) ， 0(B)) also exist, the absolute continuous parts of (/) (A) and 
沴 (B) are unitarily equivalent. 

Even when (p is not monotonic, the existence of W 土 (0(B) ， 沴 (A)) can be shown. 
Only in this case these wave operators are no longer equal to W±(B, A) but are 
composites of them. 

For a proof of these result, see chapter X of Kato’s book. 


37.5 POTENTIAL SCATTERING 

As an example of the abstract theory developed in section 37.3, take the Hilbert space 
// = L 2 (R 3 ), A = —A, the negative Laplace operator, and B = —A +g,g a real¬ 
valued function. We take the simplest case when the potential 夺 is a square integrable 
bounded function. 

Theorem 9, For q bounded and square integrable，the wave operators W 土 (B ， A) 
exist, where A = - A, B = - A + 

Proof, Fourier transformation is the spectral representation for —A, and shows 
that its spectrum is absolutely continuous, fills M, and is of infinite multiplicity. We 
appeal now to theorem 5 in section 37.3. We take for the dense subset J the linear 
span of functions of the form 泛一以一 a) 2 /2, a in R 3 . For w of this form, we can solve 
the equation u t = —/ Aw = / Aw explicitly by Fourier transform: 

U t =-i^U, H(0) = e -? 2 /2 + ) 

so u(^, t) = e -(2»'»+i)|-/2+ia^ Taking the Fourier inverse gives 

u(x, t) = e~ i,K u{x) = (1 + e -b-a) 2 /(2+4"). 

Clearly, this function belong to the common domain of A and B, as required in (i). 
Conditions (ii) and (iii) are also fulfilled, since 
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||(B-A) e - /,A «|| = ||?«(-r,0ll<ll«7ll |1 +2"「 3 / 2 

is a continuous function of t, integrable over the whole t axis. 

It remains to show that the linear span of the functions j s dense in 

L 2 (M 3 ). According to the spanning criterion(see theorem 7 of chapter 6 ) we have to 
show that a function / in L 2 (K 3 ) that is orthogonal to all functions j s 

zero. To see this, we rewrite this condition as 

0 = / f(y + a) e~ y2/1 dy - J e ia ^ 

here in the first step we changed the variable of integration to x — a = y y and in the 
second step we use Parseval’s relation. The last equation says that the Fourier inverse 
of fe - 与 2 , 2 is identically zero. But then so is and so is /(^) and / itself. 

^ □ 

We conclude from theorem 9 that —A is unitarily equivalent toB = — acting 
on an invariant subspace of B. In particular, the continuous spectrum of —A + q 
contains the whole positive axis, of infinite multiplicity. 

The restriction that q be square integrable can be relaxed. For sharper result, see 
chapter X of Kato’s book, and the literature quoted there. However, if q is merely 
bounded, then with probability 1， △ + 分 has a pure point spectrum. This result, of 
importance in solid state physics, is called Anderson localization; for example, see 
Frohlich and Spencer. 

On the other hand, if the potential q(x) tends to zero fast enough as \x\ co, 
one can show that (入一 A) 一 1 and (入 一 B) 一 1 differ by a trace class operator when 
X < 0. Then it follows from theorem 8 that W±(A, B) and W±(B, A) exist, and so 
一 △ and the absolutely continuous part of —A + <7 are unitarily equivalent. 


37.6 THE SCATTERING OPERATOR 

Suppose that B is a perturbation of A, and that the generalized wave operators 
Wi(B, A) exist and map the absolutely continuous-part-of-A-onto-the-absolutely 
continuous part of B. Then, by (90, 

W+A = BW+ and W_A = BW_. 


-From these-retodons-we-deduc&-.that ^ -- ^. - . -= v - :--- —— 

W： 1 W + A = W ： ! BW4. = AW~ l W + . 

In words, WI 1 W+ commutes with A. This operator, WI 1 W+，is called the scat¬ 
tering operator, ^.nd is denoted as S. 

The physical significance of the scattering operator is this: 


1 














I as an unperturbed and perturbed S chrodinger operators, where 
the perturbation is negligible. The operators discussed in sec- 
a pair, for the potential q(x) tends to zero as x tends to oo."For"—— 
most of the signal has propagated to large distancesr-so-that-the —— 
very little from a signal governed by the unperturbed equation, 
larly, for t large, negative e lt ^ u differs very little from a signal 
nd to ±oo, we deduce that 


u + = t lm o e-^e^u = WZ 1 u 




>perator linking and u + , WZ 1 W+, is the scattering operator. Thus the 
ring operator links the state of the perturbed system in the remote past to its 
In the dim future. 

is time-dependent picture of the scattering process has been described in 1945 
0ller. A stationary picture has been formulated by John Wheeler in 1937. It 
laborated by Heisenberg in 1943; Heisenberg’s motivation was that a physical 
/ should only deal with observable quantities. The forces acting on electrons 
Hiding a nucleus cannot be measured; a physical experiment measures only 




about this fascinating problem, another gift bestowed on mathematics by 
we refer to the review article of Ludvig Faddeev, and to volume 2 of Reed 

)n.. . . ..‘ 


S commutes with A, the natural description of S is in a spectral representa- 
where S acts a multiplication operator. We will elaborate this in a slightly 
setting in the next section. 


: CAL NOTE. In 1930 Heisenberg lectured at the University of Chicago on 
the new quantum mechanics. His assistant there was the young American physicist 
Frank Hoyt, who helped prepare the English lecture notes. During the Second World 
War Hoyt joined the Manhattan project for building nuclear weapons; one of his 
assignments was to scrutinize every wartime publication of Heisenberg and see if it 
could be a by-product of bomb research. Hoyt studied very thoroughly the two papers 
Heisenberg published in 1943 on scattering theory, and as he told me later, concluded 
that they had no bearing on nuclear weapons. This may have saved Heisenberg's 
life, for the OSS, the wartime precurser of the CIA, had been training an agent to 
assassinate him. 
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LAX-PHILLIPS SCATTERING THEORY 

The setting is the same as in section 4 of chapter 36: a unitary group U(0 acting on 
a separable Hilbert space H, and a pair of incoming and outgoing subspaces 尸一 and 
orthogonal to each other, each satisfying properties (27a) - (27c)，respectively 
(28a)-(28c) ^ 


U(t) map F- into F_ for t < 0, 

(17a) 

The intersection of U it) F- is {0}. 

(17b) 

The union of U(t) F- is dense in H, 

(17c) 


and similarly for F+，changing the sign of t in (i). 

We appeal now to the translation representation theorem 7 of chapter 35, which 
says that if the above three conditions are satisfied, then H can be represented iso- 





d/:+ represent u isometrically, ||灸 一 || = ||w|| = || 女十 ||， so S 
nee it maps L 2 (N, R) onto Lr{N, K), it is unitary, 
nd k + (x -t) both represent U(t) u，S maps the translate of 
translate of k^. 











(iii) Any *:in L 2 (7V ， M—) is the incdmirig representation of a vector m in 
Such a vector u is orthogonal to F+; therefore its outgoing representation 
is orthogonal to the outgoing representation of whidfis L 2 . 

This shows that belongs to L 2 (Ni M-)r — - 

Property (iii) is called causality, and can be put in the following words: The value 










w uc a vcului in n, k— ana us incoming ana outgoing uansiaiion repre¬ 
sentations. The Fourier transforms of and /:+， denoted as / 一 and /+， are the 
incoming and outgoing spectral representations of u. We denote their relation as 

Sf- = /+. (19) 

Theorem lO^. 

(i f ) S is unitary; 

(ii’）S commutes with multiplication by bounded, measurable junctions; 

S maps the Fourier transform of Lr{N, K_) into itself. 

Proof. (i f ) follows from part (i) of theorem 10, since S is unitary and so is the 
Fourier transform. 

(ii’）The Fourier transform transmutes translation by an amount a into multiplica¬ 
tion by e iaK . Therefore, by part (ii) of theorem 10, S commutes with multiplication 
by e iaX . Given any bounded, measurable function b(k), we can approximate it by a 
sequence b n of the form bn(X) = so that lim b n (X) = b(X) a.e., and 

so that the functions b n are uniformly bounded on M. It follows that for any / in 
L 2 (N, R), b n f tends to bf, and b n Sf tends to bSf, in the L 2 (N, M) norm. Since S 
is a bounded operator, Sb n f tends to Sbf. Since Sb n f — b n S /, (iiO follows. 

(iii’) is a restatement of part (iii) of theorem 10 in the spectral representation. □ 

Let k be an L 2 function supported on K-. Then its Fourier transform / can be 
extended as an analytic function to the lower half C 一 of the complex plane, f = 
入 + / % < 0, by the formula 

■m = -^= J° k(x)e^ x dx. (20) 

Theorem 11 (Paley-Wiener). The Fourier transform f of a function k in L 2 (N, M_) 
is a vector-valued function in l?(N 、 M), that has an analytic extension /(f) into 
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(i) For fixed r] <0, /(A. + irf) is a vector valued L 2 functon of "L As t] tends to 
—oo, ||/(. + /? 7 )|| tends to 0. 

(ii) As rj tends to 0, /(• + /??) tends to f in the L 2 -norm, 

Conversely, any function f with properties (i) and (ii) is the Fourier transform of 
an L 2 (N, function. 

A proof for scalar-valued functions, employing nothing more than the Cauchy 
integral theorem, is presented in chapter 38. An extension to the vector-valued case 
is, as so often, straightforward. □ 

We will denote the Fourier transform of L 2 {N ， R-) as H ， and the Fourier trans¬ 
form of L 2 (iV t R+) as H + ,H + can be characterized as consisting of vector-valued 
functions in L 2 {N,R) that have analytic extension into the upper half-plane C+, 
with properties analogous to those enumerated in theorem li. 

Theorem 12. The operator S defined in (19) can be realized as multiplication by an 
operator-valued function ( 入 ) ，mapping N into N. 

(i) M.(X) is unitaiyfor almost all X. 

(ii) M(X) is the boundary value of an operator valued function in holo- 

morphic C 一. 

(iii) For each f in C 一， M.iX) is a contraction, mapping N N. 

The function M(0 is called the scattering matrix. 

Proof. We want first to tackle (ii) and (iii). Let u be any vector in F 一. According 
to part (iii 7 ) of theorem 10’， its incoming and outgoing spectral representations /_ 
and /+ both belong to H^. So /L and /+ are vector-valued analytic functions in C— 

We now show that for any f in C 一 and any /一 in H—, the value of /+(f) is 
determined by the value of / 一 (（). To prove this, it suffices to show that if / 一 (（）= 0, 
then /+(f) = 0. We factor such an /_ as 


/-W = t—7^)- 

人十纟 

It follows from the Paley-Wiener theorem, theorem 1 1， that g belongs to H— Since 
-by-theorenr-l ： 0 V^commutes-with-iTmittpHeatton-by-functions-bounded-on-lR 7 - 



Since g belongs to H 一， by theorem 10’ so does Sg; setting 入 =< in the relation 
above shows that /+(f) = 0. 

/+(f) is related to /-(f) by a linear mapping of N N\ we denote it as 

桃） /-(?) =/+(()• (21) 
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To show that M(^ ) is strongly analytic, take / 一 (X) == ;z/(X — i)» n any vector 
in N. Clearly, /_ belongs to H-\ therefore so does /+• Set this pair in (21): 

7^ 桃 )/! = /+(?); 

. ? 

since /+(f) is analytic in C—, so is7W(f)«. 


(iii) Take any f in C-, any vector n in N, and define 


心 ) ={f H 

(22) 

For any positive r, 


一 …— 

k+(x-r) = e- iir e^ x n 

forjc < r. 

(220 

Set (22) into (22'): 



k + (x-r)-e- i?r k + (x) = 0 

forx < 0. 

(23) 

Define 



k-^S*k+. 


(24) 

Since S* commutes with translation, and since S* 
deduce from (23) that for all positive r. 

maps L 2 (N, R+) into itself, we 

k_(x - r) — e~ i(r k-ix)=0 

for a- < 0. 

(230 

This implies that 



= e^ mX m for 

x < 0 ， 

(22")' 


where m is some vector in N. We define as before the incoming and outgoing spectral 
representations as 

/_=FL，/+ = F 々 +, 

where F is the Fourier transformation. Using formulas (22) and (22’’）we get 
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We will now estimate the norm of m. Applying the Schwarz inequality to the left 
side of (260, we get __ 


^ii(m,p) W i<ii/-iiii— ii. 


(29) 


Using the definition of / 一 _as 尹 fc 一， . 

iijlii = \\ k ^\\ = ii5*fe + ii = |^|7 … • ••… ~m 


Using the definition (22) of k^. gives 


By calculus. 




p 

r+T 




(300 


J (30 〃) 


Setting (30), (30’)，and (30") into the right side of (29) gives 


\(m,p) N \ < |n|yy \p\n^ 


Since this holds for all p in N, it follows that |m|yv < |n|^. In light of (28 )， 
\M*(— ^)\n < 1, which implies that < 1, as asserted in (iii). 

p) is a bounded analytic function in C—. According to a basic result of 
the theory of analytic functions, 

lim (M(k + i rj) tu p)^ ~rj < 0,- (31) 

”一 0 


exists for a.a. real X. Take for « and p a denumerable dense set of vector in N; since 
l-A^(f)l < 1 5 it follows that the limit (31) exists for all n and p in N, for almost all 入 . 
Denote the weak limit (31) by M(X)\ clearly, |A^(X)| < 1 a.e. 

Let n be any vector in N. The function /- ( 入 ) = n/(X - i) belongs to HL \ there¬ 
fore it is the incoming spectral representation of some function u in H-. The outgo¬ 
ing spectral representation of u also belongs to Set f = 入 + / ” in (21): 


ITJ — 1 M^ + ir 1 )n = U(X + ir 1 ), ,<0, 


(210 


and let rj tend to zero. The right side tends to /+ in the L 2 (N y R) norm; therefore, so 
does the left side. It is not hard to deduce from this that (入 + irj)n tends strongly 
to M{k) n, for a.a. real k. 











Proof. Let u be an eigenvector of G: 

Gu = y w, Z(0 u = e yt u. (32) ' 

Let be the outgoing translation representation of u. Since u belongs to K, it is 
orthogonal to F+; therefore k + is zero on M+. In the outgoing representation Z(r) 

aets as-translation by r r foilowed by restriction-to-the-negative^xferso-f32)4?ecxn^ - 

kjr{x ~ o = e yt k(x), x < 0, / > 0. 

It follows that 



for.r < 0 
forO < x, 


n some vector in N. 
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The outgoing spectral representation of w is the Fourier transform of ky. 

/-j_(A.) = F = c c _ 

Since 一 1 (X) = M*(X) fpr 入 real, the incoming spectral representation is, by (21), 

f-W = 77^— M*(X)n. ■- - - --(33)— 

.•.... . - rr.K— . .. 

Since u belongs to AT, it is orthogonal to F_; therefore /L is orthogonal to H— Con¬ 
sequently / 一 belongs to H +，and thus has an analytic extension to C+. Formula (33) 
gives a meroraoiphic extension of /—to C+; it is analytic iff the potential pole at • — 
入 =—/)/ is cancelled by a zero of at f = —/y: 

M*(iy)n = 0. 

The reverse of this argument gives the converse proposition. □ 

The proof presented above gives a little more: the dimension of the nullspace of 
G — y I equals the dimension of the nullspace of M*(i y). More generally, one can 
show: 

Theorem 13 ; . A complex number y belongs to the resolvent set of G iffS{iy) is 
invertible. 

Proof. For a proof we refer to section 3, chapter El of Lax and Phillips 5 s Scatter¬ 
ing Theory. 

According to theorem 12, the scattering matrix *A4 ( 入 ） can be extended analyti¬ 
cally into the lower half-plane C_. Suppose that A4(k) is continuous in the norm 
topology along an interval I of the real axis. Then M can be continued analytically 
across 7 by the operator version of the SchwaiiISflectidirprindpS 

M(0 = M*(cr ] (34) 

for f in C+ near I. 


37.9 THE AUTOMORPHIC WAVE EQUATION 


Faddeev and Pavlov have given a beautiful application of the Lax-Phillips scattering 
theory to automorphic solutions of the wave equation in the hyperbolic plane. The 
Poincare model of the hyperbolic plane M is the upper half-plane (x, )，)，y > 0, 
equipped with the Riemannian metric 


ds 1 


dx 2 + dy 2 
.V 2 ' 


(35) 





THE AUTOMORPHIC WAVE EQUATION 


The isometries, called hyperbolic motions, can be expressed elegantly using the coi 
plex variable z = x + iy as 


az^rb 




cz + d 


, a, b y c, cl real, ad — be = L 


Exercise 9, Show that the metric (35) is invariant under the hyperbolic motions (’ 


The group G of hyperbolic motions has many interesting discrete subgroup 
which have the property that the images of any point under the mapping in r 
cumulate only at cx>. A function u(x, y) is called automorphic with respect tc 
subgroup T if u(y(x, y)) = u(x, y) for every motion y contained in T, 

A domain 尸 in H is called a fundamental domain for r if 


(i) every point in H can be mapped into a point of P by some y 

(ii) no two points of P are mapped into each other by any y in T 


A boundary point of P will be mapped into another boundary poin 
inT. 



Exercise 10. Show that the image of a fundamental domain by any y in T is anc 
fundamental domain. 


A fundamental domain is called 泛 fundamental polygon if its boundary consis 
a finite number of geodesics. The geodesics in the Poincare model are circles w: 
center is located on the line j = 0, and their limits, the lines x = const 

The discrete subgroups that are amenable to scattering theory have fundar 
tal polygons that are unbounded. In this section we will look at the simplest : 
subgroup, the modular group T consisting of all hyperbolic motions of form i 
where a.b.c.d are integers. Clearlv. these form a subsrouo. and it is not hai 










The motion z z + 1 carries the side x = of T onto the other side x = 
The motion z — — 1/z carries the third side yr -f y 2 = 1 onto itself, mapping the 
point (x, y) to (—x, y). Denote by p and p l pairs of boundary points linked to each 
other by a mapping in the modular group. A C 1 automorphic function u satisfies the 
boundary conditions 


M(p) = M(〆) ， Un(p) - -Uni^) (37) 

at corresponding boundary points of T, where u n denotes the outward normal deriva¬ 
tive. 

The Laplace-Beltrami operator in the Poincare model is 


△H = -y 2 (^l + 3v). (38) 

Exercise 15. Show that Ajj is invariant under the hyperbolic motions (36). 

Denote by (, )^ the L 2 scalar product over T with respect to the hyperbolic area 
element: 


(w ， v)t = 



(39) 


Let w be a C 2 automorphic function that is zero for )，near oo. Integration by parts 
yields 


(Ahm, u)t = Uy) dx dy; 


(40) 


the boundary term is zero because of the boundary conditions (37). 

Formula (40) shows that the operator Aj^，defined for all automorphic function is 
symmetric and nonnegative. Its Friedrichs extension, also denoted as Ajj, is a self- 
adjoint operator. What is its spectrum? It turns out that it is more natural to renor¬ 
malize Ajj as 


L = An - 

Theorem 14. 


(41) 


(i) On the intejyal 0] the spectrum of L consists of the single point — 去 . 
(7/J L has infinitely many positive eigenvalues, accumulating at oo. The odd 
eigenfunctions span the space of odd functions in T. 

(Hi) L has absolutely continuous spectrum of multiplicity 1 on ]K+. 


Proof. We will not give a complete proof of part (i), for it would take us too far 
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Denote the integrand on the right in (42) by q, split the integral into two parts, and 
use inequality (45): 



> JJ^ - dxdy + JJ^ qdxdy 


Define the quadratic functional K as 


K(U) = JJt ^rdxdy. 


and add it to both sides: 


(Lu,u) + K(u)>C(uh 


(46) 


C(«) 


b l y^\ l ^) dxd}K 


It follows from Rellich’s compactness theorem that for any positive e there is a sub¬ 
space of u of finite “） S ^C{u). Taking, modestly, e = 1, 

we conclude from (46) that (L«, u) > 0 on such a subspace. It follows from this 
and (41) that the spectral resolution of L on [一 去， 0] has finite-dimensional range; 
therefore the spectrum of L over [一 0] consists of a finite number of eigenvalues. 

Since T has finite area, u = l is square integrable, and an eigenfunction of L with 
eigenvalue 一去 . There are in fact no others in [-| ， 0]，but we skip the proof. 

(ii) Both the operator L and the fundamental domain T are invariant under reflec¬ 
tion across the y axis: x —a*. It follows that the domain of L can be reduced as the 
direct sum of even and odd automorphic functions. For odd functions the first condi¬ 
tion in (37) becomes « = 0 on the boundary; the second condition is automatically 
satisfied. 

We will show that under the Dirichlet boundary condition w = 0, the resolvent 
(L + I)— 1 is a compact operator. To see this, denote (L + l)^ l w = w; this means 
that 


luLi -\rU = w. 






T are uniformly bounded. We use Rellich’s compactne: 
in chapter 22)，applied to the compact portion Ty of T 
uniform smallness of the hyperbolic L 2 -norm of u over th 
the precompactness of the set of u in the norm ||«|| 7 .Itfo 
theory of compact symmetric operators (see chapter 28) t\ 
set of eigenfunctions over the space of odd functions in 
are real, positive, and tend to zero. The corresponding ei< 
(iii) It is in this part of theorem 14 that the renormalizat 
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rhe change of variables u = y^ 2 v, y = e s turns this equation into the classical 
kvave equation 

~ ^ss = 0 . 

rhis can be factored as 

(9 / + B s )(v t - = 0; 

it follows that v t 一 w is a function ofs — t. We define 

-/2k + (s) = v s -v t = d s e- s/ %(e s ) - e-^ 2 h 2 (e s ) (51) 

to be the outgoing translation representation of the initial data {hi, hi) = h. Clearly, 
the outgoing representation of U(f)A is k^.(s — 0- 

We next show that every function in L 2 (E) appears as the representer of some 
element in H, and that the representation is an isometry. Take any smooth function m 
Df comnact suoDort in 1 < v. and define W4 - on T as 



Clearly, Et(w) = ||i 
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The initial data /z+ of do not, in general,： belong to H\ to make them orthog¬ 
onal to the eigenfunction 1， we apply the projection Q: 


- Q-A 卞 — er 7^ — d)j- 

c andrf being constants determined by the equations 


ih x -^ dy ~ 




2-d)'^l' = T 


Using the formulas (52 ; ) for h\ and /z 2 j we can rewrite these equations as 


y 1 /2 mQogy)^~=cA', 




where A is the area of T . Switching to ^ = log y as variable of integration yields 

f e~ s/2 m(s)ds = cA, - f e~ s/2 m , (s)ds = dA. (53) 

^0 Jo 

Integration by parts shows that d = -c/2. 

Lemma 15. 


(i) QA-f has the same energy as h+. 

(ii) Q/i- 4 - has the same translation representation as h— 

Proof, (i) By definition (50)， 

Erm+) =JJ T (^ ，l2 ~ % d) +h^y- { hz^ dxdy 

= E T (h + ) - 2d JJ^ ! ^dxdy + drA + ^JJ y dx dy - ^A. 
By formula (53)，this can be rewritten as 


2 

E T (h+)-d 2 A + C —A, 

4 

which is equal to E 丁 (h+), since d = -c/2. 

(ii) By definition (51)，the difference between the outgoing translation representa¬ 
tion of h + and of Q/z+ is 

W ( 心 - " 2 c - e^ s/2 d) = (昼 + d) = 0. □ 







not hard to show that L has absolutely continuous spectrum on K, which covers all 
ofM+. □ 

In chapter 36, section 36.4, we introduced the concept of an outgoing subspace 
F+ for a group of unitary operators V(t), as follows: 


(i) U ⑴ F+cF+forf >0. 

⑻ nU(0F + = {0}. 

(iii) UU(0F + = H. 



directly, without appealing to the translation representation theorem in chapter 35, 
section 35.5. 


„.„The- theory- describeiabave_-foiUhe-moddai^mup-can-be_cacried^eii>with-aii.Ly - 

trivial changes to any discrete subgroup, except thatjn general ， pne ， camio.t(fetennine，，_:—— ; _. 
precisely the location of the point spectrum. According to a conjecture/theorem of 
Phillips, Saraak and Wolpert, there is in general, no point specfriim embedded in the 
continuous spectrum. 

If the fundamental polygon has n vertices at infinity, the continuous spectrum ofL 
has multiplicity n on R+. Even if a whole side of the fundamental polygon lies at .. 
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There is an entirely analogous construction of an incoming representation; the 

—— incoming subspace F- is formed by Q/z 一， where denotes the initial data on T 

_of incoming solutions of form = y^l 2 n([ogy + /), f < 0, where n is 

' •— ..•" 

Exercise 18. Show that for t <- 0,- the solution of the-hyperbolic wave equation with 

imM4atai^is_w = .(a. ； 


Exercise 19. Show that F-j. and F 一 are orthogonal in the energy norm. 

Clearly, = y^ 2 n(log y + 0^ < 0, describes a wave arriving in T from 
infinity, through the channel - 士 <x < 士， just-as^-is-for 十 •>•>(} *a wave-traveHng 
to infinity through the same channel. It follows from property (iii) for F_ and F+ 
that everything that flows in from infinity eventually flows out to infinity. How fast 
is an interesting question, as we will show. 

It was explained in section 37.8 that a pair of orthogonal incoming and outgo¬ 
ing translation representations are linked to each other by a scattering operator. The 
corresponding spectral representations are related via multiplication by a scatter¬ 
ing matrix M(X). In the present situation, where the multiplicity of the continuous 
spectrum is one, the scattering matrix is a scalar function. |*M ( 入 )| = 1 for 又 real, 
and M has an analytic continuation into the lower halfplane X + ir],r} <0, where 
|«M (久 + irj)\ < 1. In chapter 38 we will meet these functions again in Beurling's 
theory of the arithmetic of bounded analytic functions. 

Faddeev and Pavlov have determined the scalar scattering matrix that arises from 

the hyperbolic wave equation for solutions that are automorphic with respect to the 

modular group. It is, aside of inessential factors. 


M(k)= 


?(2/A) 

W^2iX) 


戶 ( 入 ), 


where ^(A) is a product of gamma functions, and f is the Riemann zeta function. 
If the Riemann hypothesis is true, M has zeros in the lower half-plane on the line 
+ A; its meromorphic continuation into the upper half-plane by Schwarz reflec¬ 
tion has poles on the line |/ + 入 • We saw in section 37.8, theorems 13 and 13’， that 
if 入 + is a zero of M, then rj + iX is an eigenvalue of G，the generator of the 
semigroup Z ⑴ associated with the group of unitary operators U(r) and the pair of 
incoming and outgoing subspaces F_ and F+. Faddeev and Pavlov point out that if 
one could show that G has no eigenvalues y whose real part exceeds — |，the Rie- 
mann hypothesis would be proved. According to Phillips’s spectral mapping theorem 
for semigroups, theorem 12 in chapter 34, if y belongs to the spectrum of G, e y ， be¬ 
longs to the spectrum of Z(t). Since the spectral radius does not exceed the norm, 
\^ yt \ < \[L(J)\\e- Taking logarithm and the limit r —»• oo, we deduce that 


Rey < lim -log||Z ⑴ || £ . 
















Therefore to prove the Riemann hypothesis, it is sufficient to show that 

lim ilog||Z(f)||£ < (54) 

/~>oo t 4 

Faddeev and Pavlov point out that (54) is necessary as well. It is not hard to show 
that (54) would follow if 


lim jlog||Z(0/z|| £ 4 

could be proved for a set of data h dense in the domain of the semigroup. 

Could this formulation lead to a proof of the Riemann hypothesis? If it does, you 
will hear about it. 

BIBLIOGRAPHY 

Beardon, A. F. The Geometry of Discrete Groups. Graduate Texts in Mathematics, 91. Springer-Verlag, 
1983. ' • ^ 

Birman, M. Sh. A test for the existence of the wave operators. Doki Akad. Nauk. SSSR y 147 (1962): 
506-509. 

Cook, J. M. Convergence to the M0ller wave matrix. J. Math. Phys., 36 (1957): 82-87. 

Faddeev, L. D. The inverse problem in the quantum theory of scattering. Usp. Mat. Nauk., 14, 57 (1959); 
English translation by B. Seckler ， Math. Phys” 4(1963): 72-104. 

Faddeev, L. D. and Pavlov, B. S. Scattering theory and automorphic functions. Seminar Steklov Math. Inst. 
Uningrad, 27 (1972): 161-193. 

Fr6hlich ， J. and Spencer, T. A rigorous approach to Anderson localisation. Common Trends in Particle 
and Condensed Matter Physics. Les Houches, 1983; Phys. Rev., 103 (1984): 1-4, 9-25. 

Heisenberg, W. Die beobachtbaren Grossen in der Theorie der Elementarteilchen. Z. Physik, 120 (1943): 
1513-538; 11,673-702. 

Jauch, J. M. Theory of the scattering operator. Helv. Phys. Acta, 31 (1958): 127—158. 

Kato, T. Wave operators and unitary equivalence/ 尸 ac(/5c / Math” IS (1965): 171-180. 

Kato, T. Perturbation of continuous spectra by trace class operators. Proc. Jap. Acacl,, 33(1957): 260-264. 

Kato, T. Perturbation Theory for Linear Operators. Grundlehren der Math. Wiss in Einzeldarstellung, 
132. Springer, Verlag, 1966. 

-- TOiroda7S.^ ： OFaffieoremof^W^^^ Proc ： TdpTA6ad ：；^ ■蚱 I M57- - 

Kuroda, S. T. On the existence and the unitary property of the scattering operator. Nuovo Cimento, 12 
(1959): 431-454. 

Lax, P. D. and Phillips, R. S. Scattering Theory. Academic Press, New York, 1967. 

Lax, P. D. and Phillips, R. S. Scattering Theory for Automorphic Functions. Ann. Math. Studies, Princeton 
University Press, Princeton, 1976. 

Lax, P. D. and Phillips, R. S. Translation representation for automorphic solutions of the wave equation in 
non-Euc!idean spaces. CPAM, 37 (1984): 303 - 328,780-813. 




Lax, P. D. and Phillips, R. S. Translation representation for automorphic solutions of the wave equation in 
non-Euclidean cases; the case of finite volume. Trans. AMS, 289 (1985): 715-735. 

M0ller, C. Genera] properties of the characteristic matrix in the theory of elementary particles. Kgl. Dansk. 
Videnskab ， Selskab, Mat.-fys. Meclel., 22, i (1945); 23, 10 (1946). 

von Neumann, J. Characterisierung des Spectrums eines Integraloperators. Actualites Sci. Jnd. t 229 
...... (1935): 38-55, • 

Phillips, R. S. and Samak, P. Perturbation theory for the Laplacean on automorphic functions. J. AMS, 5 
(1992): 1-3. 

Reed, M. and Simon, B. Scattering Theory. Academic Press, New York, 1979. 

Rellich, F. StSrungstheorie der Spectralzerlegung. Math. Ail, 113 (1937): 600-619,677-685; 116(1939): 
555-570; 117 (1940): 356-382; 118 (1942): 462-484. 

• 一 RxisenblnTn,~M.*Pertnrbation of the continuous spectram and unitary equivalence. Pacific J. Math.. 1 
(1957): 997-1010. „ 

Shatten, R. A Theory of Cross Spaces. Ann. Math. Studies, 26. Princeton University Press, Princeton, 
1950. 

Weyl, H. fiber beschrankte quadratische Formen deren Differenz vollstatig ist. Rend. Circ. Palermo, 27 
(1909): 373-392. 

Wolpert, S. A. Disappearance of cusp forms in special families. i4/i. Math., (2), 139 (1994): 239-291. 





38 

A THEOREM OF BEURLING 


38.1 THE HARDY SPACE 

In this chapter we study the space of square integrable analytic functions in relation 
to the algebra of bounded analytic function. 

The Hilbert space £ 2 (Z+) consists of vectors x == (ao, a!，•••)，~ complex num¬ 
bers such that 

W 2 = [|o/j 2 < oo. (1) 

This space can be represented as a space of analytic functions f(z) in the unit disk: 
oo 

f(z) = J2a n z n . (2) 

0 

In this representation the space is called Hardy space and is denoted as H+. We have 
already come across this space in chapter 27, section 27.2. 

The L 2 -norm of / on any circle of radius /* < 1 is 

f \f{re w )\ 2 dQ = \a n \ 2 r 2n , 而 = 芸 . (3) 

We^efine … . . . 

ll/ll 2 = sup / \f(re w )\ 2 ct9 (30 

r .<l J 

.as.the_npm in H+, isometric with (1). The difference f(re id ) — f{se w ) has norm 

\\f(re id ) - f(se i6 )\\ 2 = |a„| 2 |r n - j n | 2 ， 

which shows that f(re id ) converges as r -» 1 in the L 2 sense. This limit is the 
boundary value of the function f(z) on the unit circle, 

= (4) 

0 
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to N\ for if not, there would be a 
、， such a g would belong to z k N 
der at z = 0. This is impossible, 


W = M©zMe …. (12) 

s M. Let m be any function in M; it follows from (10') 
k > 1-, and so in particular, to z k m. , 

e Wk \m(e w )|V(9 = 0, 々 = 1,2，... (13) 

conclude that (13) holds forA: = —1 ， 一 2, ...as well. 




functions 


aiy cumpicA consiani, pm is constant; aiviaing Dy mm, we con- 
onstant, that is, that p and m are proportional. 

)in M to have |p| = 1; then all functions in M are constant 
ing this into (12) shows that every function f in N can be decom- 

=aop + za\p-jr - •) = pg. (14) 

， \fi^ i6 )\ = I 汾 ’ 0 )|; since / belongs to if+, so does g. Thus 
epresentation (8) of Beurling’s theorem. □ 
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Beurling has shown how to use theorem 2 to factor bounded analytic functions: 
Theorem 3. Evety function b in B can be factored essentially uniquely as 

b = pu, (15) 


where p and u are bounded, analytic functions, \p(e ld ) = 1, and uH is dense in H. 



















The simplicity of the criterion for division by an outer function shows that they 

may be called quasi-units of the algebra S. ____ 

We turn how to divisibility by inner functions: 

Theorem 5. An inner function p divides a function b in B iff pH+ contains bH+. 













cording to definition 
is dense in H 七 , 


Theorem 7. Let c, d and e be three functions in B, e relatively prime to both c and cl. 
Then e is relatively prime to their product be. 

Proof. By definition of relatively prime, eH+ -f cH+ and eH+ + dH+ are dense 
in H— Then d{eH^. + cH^.) is dense in dH+\ therefore eH+ + d(eH+ + cH +)= 
eHjf. -f- dcH+ is dense in H+. This proves that e and dc are relatively prime. □ 

These results on divisibility in the algebra B can be used to develop a theory of 
primes in B: 

Theorem 8. Let u be any point in the unit disk: \u\ < 1. The function 
z 一 u 

p(z) = - - r (19) 

uz — 1 

is a prime inner function of the algebra B, 

Proof. An elementary calculation shows that |/?(z)| = 1 for |z| = 1, so p given 
by (19) is an inner function. To see that it is a prime, we note that since p vanishes 
at 4 = so does every function mN = pH+. Conversely, if / in vanishes at 
z = «, then f/p belongs to H + , so / belongs to N = pH— This shows that pH+ 
consists exactly of those functions in H+ that vanish at u. It follows that N has 
codimension 1 in H+. This implies that p is a prime. To see this, suppose that q is 
an inner function that divides p\ according to theorem 5, then qH+ contains pH+. 
Since pH+ has codimension 1, qH+ is either pH^ or H+; in either case q is, by 
theorem 3, a trivial factor of p. □ 


We will see a little later that the functions (19) are the only primes in B. However, 
B contains in addition prime powers. These are technically easier to analyze if we 
switch-from the unit disk to the- upper half-plane. The transformation 


( 20 ) 


maps the upper half-plane Imiy > 0 onto the unit disk fi] <1. The relation 


客 ⑽ =/ 


(w - i 、 


(200 


assigns a bounded analytic function g in the upper half-plane to every bounded an¬ 
alytic function / in the unit disk, and conversely. This relation is an isometric iso¬ 
morphism of these two normed algebras. We will use the same symbol B for either 
of these two isomorphic algebras. 


•6 
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Theorem 9. The function - 

- - - - : -~•… • _ ( 21 ) 

is an inner function. Fw^thermore it is a prime power in the sense that its only fac¬ 
torization by inner functions is 

沪 1 V，.. a + b^\. _ _ ； (22) 

Proof. Obviously |/7 (iu) I < Ffor Imiu > 0, = 1 for Im w = 6. Suppose that 
there were a nontrivial factorization 

— e,U< ;g( 土 5 .⑽， (22 7 ) 

p and q inner functions. Denote the real and imaginary parts of w as x + /v. Take 
the log of the absolute value of (22 f ): 

-j = log|/7|+log|^|. (22 〃） 

We define 

^ = -log|p| (23) 

and claim: 

(i) h is harmonic inlhe upper half-plane. 

(ii) /? is positive in the upper half-plane and is continuous down to the boundary 
y = 0, where it is equal to zero. 

For it follows from (22’）that p and q have no zeros in the upper half-plane; this 
shows that h, the real part of the analytic function log p, is harmonic as asserted 
in (i). 

Since both p and q are nontrivial inner functions, \p\ and \q\ are < 1, and so 
log |p I and log | ^O in the upper half^plane ； The^ inequalities with 

(22") give 

0 < h(x,y) < y. 

The assertions (ii) follow immediately from these inequalities. 

A harmonic function that vanishes on a straight line boundary can be continued 

by reflection across that boundary as a regular harmonic function. Thus h can be 

extended to the whole plane by setting 

-v) = -h{x,y). 

We apply now the Poisson formula, see equation (29’）in chapter 11, to A, replacing 
the unit disk by a disk of radius R : 

呛參 r …二 :( 二 )+ ， 騎 ， ( 24 ) 
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where (jc, y) = r(cos0, sin 9) and 

k((/) i R)=h(Rcos^ 1 R sirup). (240 

Since h is an odd function of y, k (中 + 7r, /?) = 一 / :(0, R), so we can rewrite (24) as 

h(x,y)= ^ Q{R,r r e t (l>)k{<l>,K)d(f>, 

Jo (25) 

Q = p(R，ne — ci>)-PiR，ne + (i>), 



ratio of the integrals also tends to 1. Since the left side is independent of it follows 
that - 

Kx\,y\) =21 

Kx^yi) ^ yi 

- Ihis>pcaves-thaUL(x r ^)-s^^aj^C)L3y4^)^l-/iC^k^^^a^ 

.... r __.similari.y^X^).^^S.,^ _. □ 

The analytic mapping iy (c - u;) -1 , c any real number carries the upper half¬ 
plane into itself. It follows therefore from (21) that the inner functions 

.. pW = exp{f(c r wr 1 } (29) 

are prime powers in the sense of theorem 9. 
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Next we will show that the functions (21) and (29) are the only prime powers 
by completing the discussion of-faGtGrizat4on4n*T8r~W©~F6£Feat--t0-the~tinit-disk--a!id.. 
consider any bounded analytic function b(zh Denote by luj] the set of zeros of b 
with multiplicity m j 9 and by pj the corresponding primes of form (19) except for a 
normalization that makes them real and positive at z = 0: 


PjXzr^ 


..JkjL 


Uj UjZ-： 


T30T*" 


We appeal now to the analogous discussion in chapter 9, section 9.2, that shows.that 
the infinite product, called a Blaschke product, 

pu)= ⑵' 


(31) 
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It follows from the representation (34) that, as claimed earlier, all primes are of 
the form 


u — z 

Rz-V 

and that all prime powers are of form 

v + z 

exp - ， 

v — z 

We conclude this section by remarking that parts of this theory, in particular, 

Beurling’s theorem, can be carried over to vector-valued analytic functions that are 

square integrable ， respectively, bounded in the unit disk or upper half-plane; see 
Halmos. The inner factors are operator valued analytic functions, unitary a.e. on the 
boundary, and < 1 in norm in the upper half-plane. In the language of chapter 37, 
section 37.7, scattering matrices are operator valued inner functions. 


M < 1 ， 

M = l. 


38.3 THE TITCHMARSH CONVOLUTION THEOREM 


We give now an application of theorem 9; see Lax. We consider L 1 functions on the 
positive axis ^ > 0. We denote the lower end of the support of such a function F by 
“.That is, 


If = max{??: F(^) = Oforf < r?} (35) 

An important theorem of Titchmarsh asserts: 


Theorem 10. Let A and B be two L 1 functions on R+, A * J5 their convolution: 

.(A*S)(|)= P (36) 

Jo 

Then 


- (-37-)-. 

Proof. For 与 < l a^-^b the integrand on the right in (36) is zero, since at least one 
of the two factors is zero. Therefore the integral is zero, that is, (A * B){^) = 0 for 

+ 名 b. Tliis pruwslirartTe-teft-Fkte-offSI^-is-srtheTighrstdBrwhatTemaiiTS" - 

to be- shown is^thabequalityhoIdsrTo'do^this-we-rely-on-a characterization due to. . . 

Paley and. Wiener pf the lower end of the support of an L 1 (R+) function F in terms 
of its Fourier transform 

/(iu)= r F^)e^ w d^. (38) 

. JO- . . . 

Clearly, f(w) is a bounded analytic function in the upper half-plane. Next we need 




Theorem 11 (Paley-Wiener). The function F in L 1 (1^^) vanishes oh the interval 
[0 , 幻 iff its Fourier transform f(w) is < const. e^ Jn absolute value in the upper 
half-plane Im u; = }> > 0. In the language of (35) we can express this as follows: 

= max{£: \ f(w)e^ Uw \ < const. }• . 

Proo f- Suppose that F vanishes on the interval [0, l]\ then we can write its Fourier 
transform (38) by setting f = a + € as 

poo / *oo 

f ⑽=九 F(^)e^ w d^e Uw F(a + i) e i<7W da. 

Clearly, f(w)e^~ l ^ w is bounded-in the upper haIf-planereonverselj^~suppose thfaT' 
f ( w )^ 1 w is bounded in the upper half-pfane; we claim that F, the Fourier inverse 
of /， vanishes on [0, £]. We want to prove this by showing that for any smooth 
function G(f) supported on [0, i — d], d any positive number, the scalar product 
(F, G) = 0. Denote the Fourier transform of G by g: 

pt—d 

gM = G(M)e^ x d^. (39) 

According to ParsevaFs formula 


where 


From (39) we have 



(40) 

(400 


(390 


This formula shows that|(x) can be extended as an entire function h(w) to the whole 
complex plane 


nt—d 一 

h(w)= / G^)e~^ w dl. 

Jo 

Sgc，? is smooth, h(w) is bounded in the upper half-plane Imiu = )，> 0 by 
e 一、 y I (1 + \w\ 2 ). Since / is analytic in the upper half-plane, we can by Cauchy’s 

fhp.nr^m chift lino. r\-f /At\/\ c _ 丄 1__ i . 1 . -r 


卜 0，)ACx + /〕 
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By construction, 

e (e-d)y 

l/tCx + iy)l<-— T — Jt 0 

1 + x 1 + y l 

and by hypothesis, 

\f(x -f- iy)\ < const. (- 

Using the estimates (42), (43) on the right in (41) shows that the right side of ( 
tends to zero as y tends to oo. Since the left side is independent of it is zero 
follows from Parseval’s formula (40) that (F, G) = 0 for all smooth functions 
supported on some subinterval of [0, £]. This proves that F vanishes on [0, i]. 

We can restate the Palev-Wiener theorem in the language of division in the al 
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yhe mathematician leading the British code breaker was the great logician Alan 
Turing. After the war he was prosecuted for homosexuality and hounded into suicide. 
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A 

RIESZ-KAKUTANI 
REPRESENTATION THEOREM 


There is a conundrum in mathematical analysis similar to the chicken or the egg 
question: Which comes first, the Lebesgue integral or the Lebesgue measure? My 
answer is, neither; first comes the space L 1 . The traditional approaches enlarge the 
class of continuous functions, and then show this class to be sufficiently large, that 
is, complete in the L 1 -norm. The approach described in the pages to follow puts the 
horse before the cart. The object of our desire, L l , is defined as an abstract space，the 
completion in the L ! -norm of the space of continuous functions. Then each element 
of L x is identified as a down-to-earth function, defined almost everywhere. 

The theorem in the title states that every bounded, linear functional l on the space 
C{0) of continuous functions con a compact Hausdorff space Q can be represented 
as an integral with respect to a signed finite measure on the cr-algebra of the Borel 
sets of Q: 

£(c) = J cdnt. 

In this note we use functional analysis to give a simple and natural proof of this basic 
proposition in the case where Q is a compact metric space. 

A.1 POSITIVE LINEAR FUNCTIONALS 

We will study bounded linear functionals i on the linear space C(Q) of continuous 
- real-valued-functions c on- a-eompaet metric-spacev Boundedness means that- -- 

«(c) < const. |c|max 

for all continuous functions c. A linear functional is called positive if £(c) > 0 for all 
nonnegative functions c. Note that a positive linear functional is monotone: c\ < Co 
implies that i(c\) < £(C 2 ). It follows that a positive linear functional is bounded. 
This is because every c in C(Q) satisfies c < |c| max «，where u is the unit function 
on Q : u(q) = 1 for all q\nQ\ therefore, by monotonicity, £(c) < |c| max 2(«). 
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It is a standard result, and not hard to show, that every bounded linear functional 
on C(<2) can be decomposed as the difference of two positive linear functionals. 
― Therefore it suffices to prove the representation theorem for positive linear function- 
—ikliOJfek 立码 e fte; • 即炫 跋 pting irieasure.is positive. 

Given a positive linear functional l on C(Q) t we define the €-norm of a continu¬ 
ous functions as follows:. 


where |c| is the absolute value of the function c : \c\{q) = |c(^)| for every point q 
of Q. x ' 

The quantity \c\^ is a semi-norm on the space C{Q). If we identify the continuous 

- functions whose difference has fnorm 0, we geTa genuine nomfon the'qu~otient' 

space. We denote by L the completion in ; fhe £-norm of this quotient space. We 
recall that the elements of the completion are Cauchy sequences in the £-norm of 
continuous functions; two Cauchy sequences are lumped together if their difference 
is a null sequence in the f norm. 

It follows from definition (1) of the ^-norm and monotonicity that |£(c)| < \c\^.li 
follows that the functional l can be extended by continuity to the whole space L. If 
(c„} is a Cauchy sequence tending to / in the £-norm, we define \imt{c n ) = €(/); 

^ is a bounded linear functional on L: 

. . m < !/u. (2) 

Next we show that the elements of L have some functionlike attributes. 

Theorem 1. Let <p be a Lipschitz continuous function ffi -> M; 

- <P(y)\ < k\x - y\. (3) 

Let f be any element of L; then can be defined as an element of L. Further¬ 
more, for any pair of elements f and gofL t 

Proof. Let [c n ] be a Cauchy sequence in the €-norm of continuous functions; 
denote their limit in L by/.It follows from the definition of the £-norm, the mono¬ 
tonicity of t, and (3), that 

I 沴 (c") 一 (j){c m )\i < k\c n - c m \ e . 

Thus {0(c, 7 )) too is a Cauchy sequence; denote its limit in L by the symbol 
Concerning the second part: let [d n } be a sequence of continuous functions tending 
in the £-norm to g. By (3), 


10(A) — 0(4)1 <k\c n -d n \ 





£(|^(C W ) - 0(4)1) <^(|C«-41 )； 

by definition of the €-norm，this can be expressed as 

\<P(c n ) - 4>{d n )\i <k\c n - d n \e. 

As n tends to 00 , the left and right sides tend to the left and rights of (4). □ 

The usual rules of functional calculus hold: Here are two important examples: 
Example 1. 


沴 +W = 


for.r < 0 
forO < x. 


We denote 沴 +(/) as /+， and call it the positive part of /. 


Definition L / in L is called nonnegative, denoted as / > 0, if /+ = /. 

Note that / in L is nonnegative iff it is the limit in the 名 -norm of a sequence of 
nonnegative continuous functions. It follows that if / > 0, \ f\i = £(/). Note that if 
/ > 0, ^ > 0, then f-bg>0. 

We say that f < gif g — f > 0; clearly this relation is transitive in L. 

Example 2. 

I a for x < a 
^ ioxa<x <b 
b for b <x 

for / in L we denote 0^(/) as fa- : 

Definition 2. / in L is said to be bounded if / = for some a and fc. 

Note that / in L is bounded iff it is the limit in the 名 -norm of a sequence of 
"■bontidtd i；ou lin uOu5 fuiictioiis. •" 

— r .-—Note-that if - / in iris:bounded; and 於 -is-Lipschitz continuous on the interval [a, b], 
then 0(/) can be defined as an element of L. 

If </>(x,y) is a Lipschitz continuous function of two variables, then can 

be defined for any pair of elements /, g in L. We will use only a special case of this. 
Suppose that / and g are elements of L that are both bounded. Then we can define 
their product fg as an element-of L that is bounded, as follows: 

Let (c, z ) and {d n } be Cauchy sequences of bounded continuous functions tending 
in the《-norm to / and g 9 respectively. It is easy to show that {c n d n } is a Cauchy 
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sequence in the i-norm; its limit in £ is defined as ffie product 

/ > 0, g > 0, the fg > 0. 

The following result, although simple, is very useful. 

Theorem 2 (Monotone Convergence Theorem). Let [fn] be a monotone se¬ 
quence of elements in L, say increasing, f„ < f n+] . Suppose that the sequence 
of numbers t{fn) is bounded; then f n converges in the l-norm to a limit f in L. 

Since the functional J is monotone, i{f n ) is an increasing sequence of real 
numbers. Being bounded, it has a limit. We claim that [f n ] is Cauchy sequence, for, 
since f n - f m is nonnegative for /2 > m, 

- Ufn) - D. 

Since t(f n ) is a convergent sequence, the right side tends to zero as n, m tend to oo. 

Having shown that {/«} is a Cauchy sequence, it follows the completeness of L 
that f n converges to a limit in L. □ 

Theorem 3. For any f in L 

l - limit /* =： /. 


Proof. Let {c n } be a Cauchy sequence of continuous functions whose 名 -limit is /. 
It follows that for any € > 0 there is an N such that " 

\f-CNk < (5) 

Since c N is a continuous function on a compact space, it is bounded: a < c N {q) < b 
for all q in g, for any a and b exceeding the upper and lower bounds of cn. Define 
as in example 2; by inequality (4), 

賦 U) - 被 (CN)\e < \f -c^k < 

Since the values of c" lie in the interval [a, b], 4 >q{cn) = cyv, so we can rewrite the 
inequality above as 

\fa - c n\i < e. (6) 

Using the triangle inequality and the estimates (5) and (6), we get 

If-fait = \f-c N + c N - f^\c < 1/ - cn\i + |cyv - f^\c < 2e. □ 


A.2 VOLUME 


In this section we show how to use the positive functional t to define the volume of 
any open set G in Q. 



Definition 3. A continuous function c is called admissible for the open set G if 

(i) the support of c is contained in G. 

(ii) c{q) < 1 for all 9 in Q. 

The volume V (G) with respect to i of an open set G is defined as 

V(G) = sup e(c) (7) 

c admissible 


Theorem 4. 


(i) The volume of the empty set is zero, 

(ii) V is a monotonic set function: ifG C H f then V(G) < V{H). 

(Hi) V is countably subadditive: V{l)^°G n ) < V(G n ). 

(iv) V is countably additive, that is, ifG n is a collection of pairwise disjoint open 
sets, 

V(UG„) = Y, V( <Gn). 

Proof. Parts (i) and (ii) are obvious. To prove (iii), take any continuous function 
c admissible for UG n . The support of c is a closed set; since O is compact, so is the 
support of c. Being admissible, it is covered by UG n ; therefore by compactness it is 
covered by a finite subcollection U^G n . We show now that c can be decomposed as 

N 

c = J2 Cl1 ， ⑻ 

1 

where c n is admissible for G, u n = I ， … 、 N. (iii) would follow from this; for, 
applying i gives 



i{c) = J2i(c n y, 


since by definition of volume g(c») < V(G»), 


«(c) < 2 V{Gn). 

1 

Taking the supremum of over all admissible c yields (iii). 

…- We-tum now to the decomposition ( 8 ). Each 分 in the support of c belongs to at 
least one of the sets G n , h = , N. There exists a continuous function b q with 

the following properties: 
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(i) bq is nonnegative. 

(ii) b g (q) > 0. 

(iii) The support of bq lies in one of the sets G„. 

The set where b q is positive is an open set O q which contains q\ the union of these 
open s 邮 Oq contains the support ofe. Since the support of c is compact, it is covered 
by_S^finite, number.of these open sets. The sum of the corresponding functions bq is 
—— Bpsitiyeonjhesupportofc;___....._ .. . ......\ , 

y~! bq, >0 on supp c. 

Assign to each b qi an open set-€v-that-eentains-its support：-Denote by b„ the sum 
of those b q . assigned to G„. Clearly, Ef -E^y,；now define 

b n 

c n = f c ， n = I ， … ， N. 

Clearly, ^ c n = c, and each c n is admissible for G n . This completes the construction 
of the decomposition (8). 

(iv) Given any N, we can choose admissible functions c n for G n , n < N, so that 
y(G„)-±< e (Cn ). 

Since the G„ are pairwise disjoint, c n is admissible for uf and therefore 

N \ 

E c ") ^ v (ufG„) 

Using the linearity of € and the inequalities above, we get 

Letting oo, we get 

E V(G n ) < V(UG n ). 

By countable subadditivity the opposite inequality hold; so equality must prevail. 

□ 

The following simple estimate is useful: 

Theorem 5. Let h be a continuous nonnegative function defined on Q，a some non¬ 
negative number, G a the open set G a = [q : h(q) > a). Then 






L AS A SPACE OF FUNCTIONS . S3S 

v(G a ) < -m. ⑼ 

a 

Proof. We show that any continuous function c a admissible for G a satisfies for 
every q, 

c a (q) < -Kq). 

a 

Clearly, this is true for q not in G a , for then c a (q) = 0, and h{q) > 0. It is equally 
true for q in G^, for then by definiton h{q)ja is greater than 1 ， while c a {q) < 1 . 
Since i is monotonic, 


uc a ) < -m. 

a 

Taking the supremum of this inequality over all admissible functions c a yields in¬ 
equality (9). 口 

Definition 4. A subset 5 of Q is called negligible if it can be covered by an open set 
of arbitrarilysmall volume. 

It follows from the countable subadditivity of volume [see part (iii) of theorem 4] 
that the countable union of negligible sets is negligible. 

When a relation holds for all points of q with the exception of a negligible set，we 
say it holds almost everywhere, abbreviated as a.e. 


A3 L AS A SPACE OF FUNCTIONS 

In this section we show how to associate to any element / in L a function f{q) on 
Q, up to negligible sets of points. 、 

Definitions, A Cauchy sequence {c„} is called rapidly converging if 

for all n and some constant k. 

Note that every Cauchy sequence contains rapidly convergent subsequences. 
Clearly, a sequence satisfying (10) is a Cauchy sequence. 

Theorem 6. A rapidly converging Cauchy sequence {c 7l } of continuous functions 
converges almost everywhere, that is except fora negligible subset of Q. 
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Proof. Denote by G n the open set of points satisfying - 

__ °n = |g .:上 , (Cl) 二 C„+i {Cj)\ > J • ( 11 ) 

Combining inequality (9) for h = |c„ - c„ +I | and « = 1 /；i 2 and (10), we have 

一—一 …一一 —一―一代 0 疗々2^穹上^制今:一： (12) 
一 —— . . ■■■— 一- - - 11 二 . . - -- - - 

It follows from (11) that for q not in G, u \c n {q) - c n+ \{q)\ < I//7 2 ; therefore the 
sequence c n (q), written as a sum 


c n = -Q-j), 

1 

converges at every point q that belongs to only a finite number of the sets G n . The 
exceptional set of points, where [c n (q)\ fails to converge, can thus be enclosed in the 
union U^G n . By countable subadditivity, the volume of this set is bounded by 

00 

V(U^G n )<J2V(G n ) 

N 

- which according to (12)-is < and thus tends to zero as N tends to 00. 

Thus the exceptional set can be covered by open sets of arbitrarily small volume, and 
therefore form a negligible set. 口 

It is important to note that the sequence c n (q) converges uniformly outside the 
open set U^G n ,N arbitrary. ^ 

Given any element / in L, take any Cauchy sequence {c/j} of continuous functions 
that converges to /. It is easy to see we can select a subsequence that converges 
rapidly;, the pointwise limit of this sequence is called a realization of /. 

We claim that realizations obtained from any two Cauchy sequences converging 
rapidly to / are equal a.e” for, we can select a single rapidly convergent Cauchy 
sequence that contains infinitely many terms from both sequences. 

The following theorem characterizes the relation of / in L to its realization. 

Theorem 7. 

(0 For any Lipschitz continuous 0 , (j){f){q) = 0 (/( 分 ))• 

⑻ （/ 士 g){q) = f(q) ± g(q). 

( 出 ) If f and g are bounded elements of L, (fg)(q) = f{q)g{q). 

( iv ) The realization of elements of L by functions is faithful, that is, if f(q )= 
then f = g in L. 

(V) Suppose that i-limit f n = / and that f n (q) converges a.e. Then the point- 
wis ^ limit of f n (q) is a realization of the i-limit /. 







(i), (ii), and (iii) are obvious. To prove (iv), it suffices to show that 
presented by a function that is zero a.e., then / = 0 in L. We argue 
posing that / ^ 0. So we may assume that f is nonnegative. Since 














M3 




using ims inequainy ana^iJ ; comomea wan {/.), w 

«(/) = «(/ D + l{c N ) S 1/ - 袖 + (1. 
一 S'_ 一 /Iraicjffiiegative elemenrof Lv £(/)~= [fte'f 
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Theorem 8. Let f be an element of L whose fedlization f(q) is > 0 a.e. Then f is 
>0 as an element o f L. ... : •… 


If 7(^j'">'0 a ： e：,' 

it follows that f+(q) = f(q) a.e. It follows then from part (iv) of theorem 7 that 
/+==/• but this is the definition of / > 0. - —- - Q - 


Corollary. Two ele— 屯 丄听 4 g in. L satisfy 丄 q) _ 

Proof. The relation f < g means that g — f + p, p nonnegative. Now apply 
theorem 8 to p. 口 


A.4 MEASURABLE SETS AND MEASURE 

Definition 6. A set S in O is measurable if its characteristic function 


csig ) : 


1 when ^ in 5* 

0 when q not in S 


is the realization of some element fs in L. The measure of S is defined to be 


(16) 


(17) 


Note that the notion of measurability, and the value of measure, depends on the 
linear functional l. 

We show first that this notion is not vacuous. 


Theorem 9. Evety open set is measurable, and its measure is its volume. 

Proof. We will use the function G c ), the distance of ^ from the complement 
of G. Set 

f 0 for 文 5 l/2/i 
<j> n (x) = I linear in between 
l 1 for \/n < x. 

Define the continuous functions c n {q) = <j) n {d(q, G c )). Clearly, c„ is an increasing 
sequence of functions, and equally clearly, £(c„) is bounded by €(w). Therefore, by 
theorem 2, the monotone convergence theorem, the sequence {c„) converges in the 
£-norm to a limit which we denote as fc ，On the other hand, [c n {q)} converges for 
every q to the characteristic function CQiq) of G. According to part (v) of theorem 7, 
^g(^) is the realization of fc\ this proves that G is measurable. 

To determine the measure of G, we note that since c n is ^-convergent to /c, £(c n ) 
tends to £(/g). Since each c n is admissible for G, £(c n ) < V(G), and so £(fc)= 
lim £(c n ) < V(G). On the other hand, given any admissible c, we can choose n so 









large that c < c n . Since V(G) = sup£(c), it follows that 1(fc) = Vmi(c n ) > 
V (G). Combining the two inequalities shows that i(fc) = V (G). □ 


Theorem 10. The collection of sets measurable in the sense of (16) is a a-algebra, 
and the set function m(S) defined by (17) is a measure. That is, 

(i) the complement of a measurable set is measurable, 

(ii) the intersection of two measurable sets is measurable. 

(iii) the denumerable union of measurable sets is measurable. 

(iv) m(S) is a countably additive set function. 

Proof, (i) S is measurable if its characteristic function cs is the realization of an 
element of fs in L. But then the realization of u — fs is 1 — cs, the characteristic 
function of the complement of S. 

(ii) If fs 、and fs 2 are elements in L whose realizations are the characteristic func¬ 
tions of S\ and S 2 , then the realization of the product fs^fs 2 ls the characteristic 
function of S\ fl 52- 

(iii) Let {5 n } be a denumerable collection of measurable sets. We can replace 
this collection by another one, {T n }, where T n = S n H (Sj U . .. U S n -i) c , Clearly, 

Ti U ... U = 5i U ... U and therefore UT n = U5 ；I . The sets T n are pairwise 
disjoint. Consider now the series YlT /r„; since each fr n is nonnegative, the partial 
sums E^/r„ of this series are increasing. Furthermore, since the T n are pairwise 
disjoint, h n < therefore h n ) < ^(«). We appeal now to the monotone 
convergence theorem to conclude that the partial sums converge in the £-norm to a 
limit that we denote by fy ： 

00 

X] fTn = fT ( 18 ) 

1 

On the other hand, the realization of the partial sums is ct„ ( 分 )， which converge 
pointwise to cx(q), where T = UT n . By part (v) of theorem 7, cj is the realization 
of fj\ therefore T = \JT n = U5 /2 is a measurable set. 

(iv) Since the partial sums Of (i8) eonverprimhetqffOmrfO'/fTWeMlpply - 

i to obtain ^ ^(/r) - Using the definition (17) on both sides, we obtain 

Z n { m(T n )=m(T). . □ 


Theorem ll. A realization f{q) of any f in L is~d measurable fiinctioh^ihat Ts, the 



K a = {q ' f(q)>a] (19) 

a any real number, are measurable sets. 


Proof. We may assume that a is positive，for we can always add a constant to /. 
We may further assume that a = 1, for we can always divide / by a. Recall that /J 
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denotes the truncation of / from bdow by 0/and from ^ by 1. The sequence 
fn = (/ 0 ') n is decreasing, since /„ - f n ^ - /j) is the product of two 

nonnegative elements of L : Sinceeach'7 ； 7Ts"rionnegative, afllhT£(7)0lxe bounded 

from- below by zero.-Se by-tt)e-moRotone-convergenee-theoremrthe~sequence-//i - 

converges in the £-norm to a limit. On the other hand, the sequence of realizations 

converges to 1 at all points q where f(q) > 1 , and-toOatall points-where / ( 殳 ） < 1 . - 

By the part (v) of theorem 7, this shows that the characteristic function of the set K\ 
defined by (19) is a realization of the Wimit of (/ 0 】 )”. □ 

Replacing / by —/， and taking complementsrwe can' show tharall seTs'of the 
form , 

: /⑻ < a }，{q : f(q) < a], {q : f(q) > a) 

are measurable as well. 

We are ready to show that the measure m we have constructed in (17) yields the 
linear functional €， 

£(c) = / cdm ( 20 ) 

for all continuous functions c. To see this, we form the sets Kj t( : 

K j,e = {q : je < c(q) < (y + l)e}. (21) 

Denote the characteristic function of K j>e by kj， e . It follows from (21) that 

Y, ⑷ S + ( 22 ) 

holds at each point q. Since c is bounded, these sums are finite. According to the 
corollary to theorem 8 , we deduce from ( 22 ) that 

J2 j €k J^ I]0' + i)ekj， e (23) 

holds in L. Since ^ is a positive functional, we deduce that 

J2j^(kj, e )<t(c)< J2U + i)€C(k j<e ). 

Using the definition (17) of measure for the sets we can rewrite this as 

E < t{c) < + l)€m(k he ). (24) 

The left and right side in (24) are lower and upper sums for the integral / cdm in 
(20). Their difference, 
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6 ^m(^) = €m(Ukj t€ ) = €m(Q), 

tends to zero as e does. Thus (24) shows that t{c) is the only number > all lower 
sums and < all upper sums. This proves the representation formula (20). □ 

We remark that the representation formula (20) holds as well for all / in L. In this 

case the sums in (23) may be infinite. Taking / to be nonnegative, the convergence 

of the infinite series in (23) follows from the monotone convergence theorem. Since 
any f in L can be written as the difference of two nonnegative elements, the general 
case follows. 

We close this section with the converse of theorem 11. 

Theorem 12. Every function g(q) defined on Q that is measurable and integrable 
is the realizaiton of an element g of L. 

Proof. Measurability means that the sets H a = [q ： g(q) < a] are measurable. 
Define the function n g , M R, by 

n 8 {a) 

The function g is called integrable if 

J \a\dn g < oo. 

Define g € in L by 

8e = Yjj ek J' e ' 

where kj t€ is defined after equation (21). As 6 tends to zero, the realizations g € (q) 
tend to is^easy to show that if we set e = 2 一 we get a Cauchy sequence in 

the 名 -norm, whose limit g is realized by g(q). □ 

The uniqueness of the representing measure is a standard fact of measure theory. 


A.5 THE LEBESGUE MEASURE AND INTEGRAL 


If we take 2 to be a Euclidean multitorus, and £ to be the Riemann integral, our con- 
struction gives the Lebesgue'measure arid intdghition.TSSilHSiTTfiftli 玩 ) Toacff fiioFe 
- natural than the traditional ones; for the most important object in the Lebesgue the¬ 
ory is the complete space L 1 (as well as the spaces L p ). In the traditional approaches 
the completeness of L 1 is the last item to emerge; in the present approach it is the 


first. 









THEORY OF DISTRIBUTIONS 


In his formulation of quantum mechanics Dirac treated the continuous spectrum by 
employing a function on E, denoted as 8 — and ever since called Dirac’s delta func¬ 
tion 一 that is zero everywhere except at x = 0, where it is so large that the integral 
of 5 over M equals 1. Of course there is no such function. Von Neumann, in his book 
on quantum mechanics (1932), warns against basing a theory on such a fiction; he 
knew very well how to treat the continuous spectrum rigorously. 

The 5-function can be given new life as a generalized function. The need for such 
generalized functions was keenly felt in the 1920s and 1930s. Bochner introduced — 
rigorously — such notions in the context of the Fourier transform, and Sobolev in the 
context of partial differential equation. Hadamard’s use of the “finite part” of an in¬ 
tegral in his formula for solutions of hyperbolic equations foreshadowed the need 
for generalized functions, as did Wiener’s justification of the Heavyside calculus. 
L. C. Young’s idea of “generalized curves” is a step in a similar direction. But it was 
Laurent Schwartz, in the 1940s, who came up with the notion of distributions that 
was general enough and supple enough to serve most purposes of both the theory of 
partial differential equations and harmonic analysis. Harald Bohr, a leading mathe¬ 
matician of his time (and brother of Niels Bohr), was among the first to recognize the 
value of Schwartz’s ideas. The world soon followed; at the International Congress of 
Mathematician in 1950 Laurent Schwartz was honored by a Fields Medal'; pockets 
of resistance eventually faded._ 

This appendix presents the bare bones of the theory of distributions, with a few 
scraps of meat thrown in. For a fuller presentation I recommend Robert Strichartz’s 
book on the subject. 


B.l DEFINITIONS AND EXAMPLES 

We denote by Cq° the space of all complex-valued infinitely differentiable functions 
in W 1 of compact support. We say that a sequence {u^} of Cq° functions converges 
to u if the support of air is contained in the same compact set K, and if for each 
multi-index a = a|,..., a ni D a uf c = ... Dn n iik converges uniformly to D a u, 

where Z)/ denotes partial differentiation with respect to X[. 










THEORY OF DISTRIBUTIONS 

nition. A distribution is an element in the dual of Gq 0 , that is, a complex-valued 
linear functional t on Cq° that is continuous under sequential convergence as defined 
above: 

— t{u) if iik u. 


• This kind of continuity is equivalent to the following apparently more stringent 


Theorem 1. Given a distribution to each compact set K there is a positive integer 




sof distributions by assign- 



By the same token, distributions may be regarded as generalized functions. In 
what follows we will-often write- 


in (2) we have examples of distributions; we now give others. In each case we leave 
it to the reader to verify that the given functional depends continuously on u in the 
sense of sequential convergence. 


Example L £(u) = J uvdx, v any continuous function. 

Example 2. S(u) = w(0), the Dirac delta “function.” 

Example 3. £(u) = f (D a u)vdx, v any integrable function, a any multi-index. 
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Define t(u) as the principal value integral 


e(u) = PV 


/ 



u(x) 


clx. 


Later we will show that the most general distribution can be built up from distribu¬ 
tions of the form given in example 3. 


Definition. Let D be an open set in 肥 ； denote by Cg 3 the space of all C°° functions 
with support in D. A distribution in D is a linear function on C 它 continuous under 
sequential convergence as defined above. 


B.2 OPERATIONS ON DISTRIBUTIONS 

In this section we show how certain operations that can be carried out for ordinary 
functions can also be carried out for distributions. These operations T are the follow¬ 
ing kind: 

(i) T is a linear operator that maps Cq° continuously into Cq°. 

(ii) T has a transpose T’ that also carries Cq° continuously into Cq°. V being the 
transpose of T means that for all u, v in Cq°, 

(T’ 《， u) = («,Tv), (3) 

where (•,*) is the symmetric bilinear functional used in (2) to embed Cq° in 
its dual Note that the transpose T’ is uniquely determined by (3) and that the 
.transpose of V is T itself. 

The following rules dealing with transposes are obvious and useful: 


(i) If T and S have transposes, so do aT + bS 9 and (aT + bS) ; = aT f + bS f . 

(ii) The transpose of TS is (TS)' = S'T'. 

Theorem 2 (Extension Theorem). Let continuous linear opemUTmf: 

-piii^C^intd^C^TtmspdsWdre^ 

as the distributions given by 

(TVO = (v,TO. (4) 

Troof. "The reader c'aneaSly verify that thFleft 'side of (4) depends linearly and 
continuously on v, since continuity of Y is taken to mean that V maps every con¬ 
vergent sequence of Cq° functions into a convergent sequence of Cq° functions. □ 



























Example 10. Let (p: W 1 R n be a C°° mapping that takes compact subsets of 肥 
into compact subsets; suppose that 0 is invertible and that its inverse ^ has the same 
properties. Then the transformation T defined by (Tm)(x) = u(<p(x)) maps Cg° 
linearly and continuously into Cg°. The transpose of T is (T’w)(;y) = v (诊 (y)) J ()，)， 
where J is the Jacobian of 诊 ； T’ maps Cq° into itself, linearly and continuously. 

Applying the extension theorem to these examples 
following operations with distributions: 


we can give meaning to the 










⑻ i(u) = f u(x)\x\dx. 

(b) e(u) = PVf u(x)/xdx. 

(c) m=! 0 _ oo u{x)dx. 

(d) l{u) = 8(u) = m(0 ). 


B.3 LOCAL PROPERTIES OF DISTRIBUTIONS 


We recall that the support of a continuous function / is the closure of the set of points 
where f(x) 0. We will define the analogous concept for'distributions; our starting 



contained in Gi, thaLof «2 in G 2 ._We_pr.Qce.ed as follows,:. Since.each point x.in.the 
support of u belongs to at least one of the sets G 1 or G 2 , we construct a function h x 
with the following properties: 













(i) A ： v()0 > O for all j ： 

⑼ & is C 00 . 

(iii) IjxM > 0 . 

(iv) The support of h x lies in Gj or Gi^ 

… The set \^here~h x > O' is an open set containing x\ the union of these cover the 
...support of w. By compactness, a finite number of them cover the support of u. 

DenoieJjyAi th.e. sum.of..those ofthisiinite collection whose support lies in G-i, 
by ho those whose support lies in G^. Their sum li\ + ft 2 is positive on the support 
of u. Now set 

4i Il2 

- 可三 7 二含 m = 7—. 

h\ 4 - Ii2 、 h\ + «2 

Clearly, support wj c G 1 , support C G 2 , «1 and «2 both Cg°, and wi +«2 == «• 
Then 

(w ， i) = (Wj +«2^) = (Wi,^) + («2» €)=0 + 0, 

since i is assumed to be zero both in G\ and G 2 . This completes the proof of the 
lemma. □ 

It follows by finite induction from the lemma that if i is zero on the open sets 
G\, G 2 , ..., G tu then TTis"^© on tKeir union. We claim that the same conclusion 
holds for an infinite collection of open sets Gj. We have to show that if the support of 
w is contained in U Gj, the i{u) = 0. By compactness, the support of u is contained 
in the union of finitely many Gy, so the conclusion follows. 

The union of all open sets on which a given distribution t is zero is the largest 
open set on which i is zero. The complement of this set is defined to be the support 
oft * 

The following result follows from the definition of support: 

Theorem 4. Let i be a distribution, u a function, and suppose that the supports 
ofu andi are disjoint; then i(u) = 0. 

Exercise 4. Show that if 1 is a distribution with compact support, and w a Cq° func¬ 
tion, then £ * is a Cq° function. 

Exercise 5. Show that if i and m are two distributions and one of them has compact 
support, then t * m can be defined as a distribution. 

Exercise 6. Let / be a C 00 function, t a distribution such that fl = 0. Show that £ 
is zero on the open set where f(x) 0 . 

Exercise 7. Show that the support of the derivative of a distribution i is contained in 
the support of L 







Proof. We will use the following lemma: 


Lemma 6. Let t be a distribution whose support consists of the origin. Then there 
is an integer N such that i(u) = 0 for all Cq° functions u which, together with all 
their derivatives up to order N, are zero at the origin. 


Proof. Let / be a C°° function with these properties: 



for \x\ < 1 
for 2 < |jc|. 


Denote by v the function (1 — f)u\ then since fu = 0 for \x\ < 1, by theorem 4, 
i(fu) = 0. Therefore 




This shows that it suffices to look at functions u whose support is contained in, say, 
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Let U] and “2 be two Cq° functions whose values, and the values of all their 

.'denv'atives up to order N, are equal at x = 0. It follows from lemma 6 that then 

- ^( i q-)- = -^( W 2 )rIn-other wordsr^(w) only depends on the value of u and its derivatives 

up to order N six = 0. It follows that € must be of the form 

£(„>= —... 

The conclusion of theorem 5 is just a restatement of this relation. □ 

Theorem 7. Eveiy distribution i of compact support can be written in the form 

£ = J2 P a g a , 

\a\<L 

where the g a are continuous functions, and L some whole numbei: 


Proof. Let ^ be a Cq° function that equals 1 on an open set containing the support 
of €. It follows from theorem 4 that for all w € Cg°, £(u) = i{hu). 

Since every function of the form of hu is zero outside the support of h, it fol¬ 
lows from theorem 1 that there is a positive constant and an integer N such that 
\W^)\ S const. \hu\^. Clearly, \liu\^ < const. |m|^, the constant depending only 
on h. Combining these relations, we deduce that 


\i{u)\ < const. |«| 况 

for all u, with a constant and N independent of u. 
We introduce now the norms 


NIIm 


f \D a u\ 2 dx 


1/2 


M<M 


( 10 ) 


( 11 ) 


and denote by Hm the completion of Cq° under this norm. Since this norm comes 
from a scalar product that we denote as ( ， ) 似 ， is a Hilbert space. It follows from 
the Riesz-Frechet theorem that every continuous linear functional in Hm is of the 
form 


(u,g)M, g € H m . (12) 

Hm is the Sobolev space W Mi2 introduced in chapter 5. 

According to an important inequality due to Sobolev (see Adams’s book cited in 
chapter 5)，for some constant depending only on the size of the support of 
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to a C N function u. This mapping of Hm into C /V is one-to-one, so that this is an 
embedding of Hm in C N . 

Combining (10) and (13)，we see that \t(u)\ < const. ||«||^/, which implies that 
t(u) is a bounded linear function in Hm. Therefore it can be represented in the 
form (12): 

i(u) = (u y g)M =： ^2 (D 〜， D<X S)* 

\a\<M 

The right side can be rewritten as 

m = ^(-D |a| («, D 2a g ) = («, ^(-D |fl iD 2 ^), 
where D 2a g denotes a distribution derivative of g. Thus 

£ = f (一 l) |fl| D 2a : (14) 

Since g belongs to Hm ，gisaC N function; thus (14) is the representation of € stated 
in theorem 7, with L = 2M 一 N. □ 


The next result shows that just like functions, distributions are determined up to a 
constant by their first derivatives. More precisely, we have 

Theorem 8. Let G be an open, connected subset ofW 1 , i a distribution in R n such 
that all first partial derivatives Djl of l are zero in G. Then l = const, in G. 








For Cq° functions, convolution and differentiation commute. Therefore，according 
to exercise 1, they commute also for distributions: 

Dj(b^m) = (Djb) * m 

for any Cq° function b and-any distribution m. 

Exercise 8. Show that 

Dj(b^m) = b * Djni 

for every Cq° function Z? and every distribution m. 

Lemma 10. Let b denote a Cq° function whose support lies in a ball of radius r 
around the origin. Let m be any distribution; then eveiy point x in the support of 
b * m has distance < r from some point of the support ofm. 

Exercise 9. Prove lemma 10. 

Define as in lemma 9. It follows from exercise 8 that 

= Dj(b k *€)=&* DjL 

The support of 〜isxonfined to a ball of radius 1 / 上 around the origin, and according 
to the assumption in theorem 8, the support of Djt is contained in the complement 
of G. Therefore it follows from lemma 10 that Dji^ is zero in Gk, the set of all 
points of G whose distance from the boundary of G is greater than 1/k. 

Let u be any Cq° function whose support S is contained in G. It is not hard to 
show that since G is connected, there exists a positive d such that any two points of 
S can be connected by a polygonal path P in G whose distance from the boundary 
of G is > d. For k larger than 1 /d, such a path P lies in G 人 •，and therefore all partial 
derivatives D y of i^are.zero along J?. It 4 is a. constant along. P. the 

same constant for all points of S. Call this constant q; then by (2) 

— J dx ^ J udx. 

According to lemma 9, the left side tends to i{u) as oo; therefore the right side 
tends to a limit. For f u dx ^0 it follows that q tends to a limit c; it is easy to show 
that c is independent of w, as asserted in (15). □ 

Theorem 11. Suppose that g is a continuous function whose derivative in the sense 
of distributions Djg is a continuous function. Then Djg is the derivative of g not 
only in the sense of distributions but in the classical sense. 

Proof, According to lemma 9, g is the limit in the sense of distributions of the 
sequence of Cq° functions 8- Since differentiation commutes with convo¬ 

lution, Djgi^ = hi： *Djg. Since both g and Djg are continuous functions, according 













to chapter 1 1， section 11.1, gk g and Djgjc Djg uniformly on compact sub¬ 
sets. According to calculus 


gk(b) - gkW 

Letting k tend to oo, we obtain 

gib)-g(a) 



from this integral relation it follows that Djg is the derivative of g in the classical 
sense. □ 


Definition. A distribution l is called positive if t(u) is nonnegative for every non¬ 
negative Cq° function. 

Examples of positive distributions abound: 

Example 11. I a nonnegative continuous function. 

Example 12. t = 8(x - a). 

Example 13. t{u) = f u dm, m a measure. 

Lemma 12. Let l be a positive distribution. To each compact set K in R n there is a 
constant c such that \i(u)\ < c\u\ mjiX for every function it supported in K. 

Proof. Denote by /? a nonnegative Cq° function that is equal to 1 on AT. Since u 
is supported in K, for every x, 

w(a-) < \u\ mSLK p(x). 

Since € is a positive distribution, ^(«) < |w| max £(/?). Similarly -u(x) < \u\ m ^p(x), 
so —£(«) < |w| max £(p). This gives the inequality in lemma 12, with c = l{p). □ 

Using inequality \i{u)\ < c|«| max , we can extend，by continuity, the linear func¬ 
tional l to all continuous functions u with compact support; the extended functional 
remains positive. 

According to the Riesz-Kakutani representation theorem (see Appendix A) a pos¬ 
itive linear functional on the space Co of continuous functions with compact support 
can be represented as an integral with respect to a measure. This proves 


Theorem 13. Every positive distribution is a measure: 






B.4 APPLICATIONS TO PARTIAL DIFFERENTIAL EQUATIONS 

In chapter 9 we constructed the regular part g of Green’s function for smoothly 
bounded domains in the plane. Given any point q in D, g — g{p\ q) is a harmonic 
function of p whose value on the boundary of D equals log|p — q\. The difference 
logtp ■ — 々I — g('pvq) is Green’s function G(p; q). We want to show that Green’s 
•Ifunction satisfies，in the sense of distributions, the equation 

AG = 27t8(p — q), 

where A is the Laplace operator A = x and y the Cartesian coordinates 

of p. Since the regular part of Green’s function satisfies △茗 = 0,.it suffices to show 
that k 

A log y/x 2 +y 2 = 2ttS(x, j )； (17) 

here we have chosen q = 0. 

Proof. According to the definition of derivatives of distributions, (17) means that 
for all Cq° function w, 

J log \p\Audxdy = 2jtm(0). (17’） 

To see this, we write the integral on the left as the limit as e 0 of integrals over 
the exterior of circular disks of radius s around the origin: 

f \og\p\Audx dy. (18) 

. J\p\>£ 

Integration by parts changes (18) into 

f (Alog\p\)udxdy+ f log\p\d n uds- [ (d n log \p\)uds. (180 
Ap\>s J\p\^e J\p\=e 

Here d n is the outward normal derivative on the boundary of the domain \p\ > e; 
that is, d n = —d/dr on the circle \p\ = e, where r = \p\. 

Since log \p\ is a regular harmonic function for p ^ 0, A log \p\ = 0 for \p\ > e, 
so the double integral in (18)’ is zero. The first line integral in (18)’ is less in absolute 
value than const. 27te\ loge|, where the constant is an upper bound for |9 rt w|.So ase 
tends to zero, this term tends to zero. In the second line integral 

d 1 

3/i log \p\ = 一 7 log/* = —一 • 
dr r 

On the circle \p\ = e this equals —\/e. The value of u at every point of the circle 
|p| = s equals w(0) 4 - 0(e). It follows that as e tends to zero, the last term in (18)’ 
tends to 2nu ( 0 ); this proves formula (17). □ 






Theorem 14. Let i be t 
Laplace’s equation Ai = 

Proof. The proof is b; 

Lemma 15. Let f dene 
f(x) = g(\x\), andsupf 



Then there exists a sphei 
satisfies 


Proof. Write A, to be 
nater we can write (20) 


where 1 denotes differen 
equation that can be wri 


Integrating this equatior 


We claim that the right 
rewrite it as 


According to (19) this i 
ball of radius R around 
Divide (21) by r ,I_1 i 
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The difference bk t y—b nit y is spherically symmetric about the point y, and its support 
is contained in a ball of radius R around the point y, where R = max(l/fc, l/m). 
It follows from (23’）that each of the functions b^y — b m ， y satisfies conditions (19). 
Therefore according to lemma 15, it is the Laplacean of a C 00 function /z, spherically 
symmetric about the point y, whose support is contained in the ball of radius R 
around y, satisfying Ah = bk、 y — b nit y. So we can rewrite (24) as 

4()0 —“00 = (A/vO. 

According to theorem 2 we can rewrite the right side as (h ， bJi\ which is zero since 
AZ = 0. So we conclude that at all points y in D, ik(y) = (y) when k and m 

are greater than i/d, d the distance of y to the boundary of D. Therefore in every 
compact subset of D, tk does not depend on k for k large enough. According to 
lemma 9, the C°° functions converge in the sense of convergence of distributions 
to & This proves that t is C°° in D, as claimed in theorem 14. □ 

Theorem 14 is true for solutions of any elliptic partial differential equation with 
C°° coefficients. 

Solutions of hyperbolic partial differential equations are quite different in regard 
of differentiability. Again we take the simplest case, the wave equation in one space 
dimension: 

utt - Hxx = 0 . ( 25 ) 

Every function of form 

u(x, t) = f(x +1) ^ g(x - t), (26) 

where / and g are twice differentiable functions, is a solution of (25). Conversely, all 
twice differentiable solutions of (25) can be written in this form. To see this, rewrite 
(25) as 

(D t + D x )(u e - u x ) = 0, {D t - D x )(u t + u x ) = 0. 

From these equations we conclude, respectively, that 

u t - ii x = a(x - t) y u t + u x == b{x + 0, 

- ---where a and-fe are-once-difFerentiable-functions^-Adding-these-and-integratmg-once. 

yields the representation (26). . . ..〜 ― 

What about solutions that are distributions? We claim that for any pair of distri¬ 
butions l and m in a single variable, 

u = i(x + 0 + fn(x - t) (260 

is a solution of the wave equation in the sense of distributions. To verify this' we note 
that it follows from example 6 in section B.2 that m(x — t) can be defined as 
























Proof. The integral in formula (30) can be differentiated with respect to f , giving 

D a Fu = Faixfu). (31.) 

For u in 5, x a u tends to zero as \x\ oo faster than any power of |x|; therefore x a u 
belongs to O. It follows that for any multi-index a, D a Fu is a continuous function. 
… This - proves "that Fw is C°°. To show that Fu is in class S, we deduce from (30) by 
: integration by parts on the right that for any multi-index p, 

^¥u^¥((iD) p ii). . (32) 

Combining (31) and (32), we get ^D a Fu = ./^FCD^w). This shows that 
^D a Fu is a linear combination of the Fourier transforms of functions of the form 
D u. Since for u of class S these belong^to L 1 , it follows from exercise .16 that 
their Fourier transform is bounded in R n . This shows that Fu belongs to class S. 

□ 

Exercise 17. Show that the mapping w — > F« is a continuous mapping of S into S. 

The following theorem summarizes the relation of the Fourier transform to the 
usual operations in R n . 

Theorem 17. 


(0 F transmutes, translation in M 71 into multiplication by e ia . 与 . That is, define T a 
by (T a u)(x) = u[x — a). Then 


¥T a u = e ia ^Fu and T a Fu = Fe~ ia ， x u. 
(ii) The infinitesimal version of(i) is 

FiDjU = 与 jFu, DjFu = FixjU. 


(Hi) F commutes with rotation around the origin, and with reflection R defined by 
(Ru)(x) = w (- A’). 

(iv) Let A be an invertible map of R 71 into K 11 . Denote u(Ax) as ua(x). Then 


FiM 


|det>4| 


(Pu)b, 


where B = (A— 1 /. 


Exercise 18. Prove theorem 17. 

Exercise 19. Prove that the convolution of two functions u and v in 5 is in S, and 
that 


F(« * u) = (2 丌 ) " /2 (F«)(Fu). 
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The kernel of the Fourier transform is a symmetric function of x and Therefore 
F is its own transpose F’. So we can use theorem 2, the extension theorem, to define 
the Fourier transform of a tempered distribution 

(Fu，《) = (u ， F£) (33) 

for all v in S. 

Theorem 17 / . Theorem 19 is valid for the Fourier transform of tempered distribu¬ 
tions. 

Exercise 20. Prove theorem 17' 

For distributions i with compact support, we can use definition (30) directly to 
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Suppose now that AMs > 0; then there is an a, |a| = N t for which c a 7^ 0. 
Combining (37) with lemma 19, we conclude that c a = 0, a contradiction. Therefore 
we conclude from (36) that d = c 0 8. So all that remains to prove theorem 18 is to 
determine the constant c 0 - This is easily done; in the definition (33) of F£, 

(Fv，£) = (i; ， W )， 

set = 1 and u = e - xl I 1 ， F£ = d = c 0 S f and according to exercise 5, Fu 二 
e-^ 2 / 2 : 

(e-^ 2 ,l) = (e-^ 2 ,c 0 S). 

The left side equals f R „ e~^' /2 d$ = (2jr) n/2 -, the right side equals c 0 , the assertion 
of theorem 18. 口 


Theorem 20. 

(i) ¥isan invertible mapping ofS into S, and its inverse is given by 

u{x) = J u^)e~ ix -^. (39) 

(ii) F is an invertible mapping of S’ Into S', and F - 1 = FR t where R is defined 
in part (Hi) of theorem 17. 

Proof. - 

⑴ According to part ⑴ of theorem 17, Fe 一邮 =r fl F. This implies, by theo¬ 
rem 17’， that for every distribution £, 

Fe - 叫 = T a FL 

Take € = 1; we get, using theorem 18, that 

Fe - ia , = (2n) n/2 8(x - a). 

Setting this into formula (33), we get for all u in 5 and i = e^ ia '^ that 
( 5 , e^) = (27t) n/2 (v t 8(x-a)). 
Xh^rfurdelsJSIf)^ 

(ii) In (39) replace 专 by —夸 as variable of integration; we get that F~ ! = FR. We 
can express this as FRF = I, the identity. So for any function v in 5 and any 
tempered distribution i f 

(y, t) = (FRFv, i) = (RFv, Ft) = (Fv t RK) = (u ， FRFO. 

This proves that FRF£ = i, that is, that RFisa right inverse for F on S\ and 
FR a left inverse. Since F and R commute, this proves part (ii). □ 
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Theorem 21 (ParsevaFs Formula). The Fourier irdnsfoim u of evety L?" function 
u lies inL 7 - and 陶 厶 2 = \\u\\ L i, 一 

…厂 We'wanrtirstixrprOTe-thisibr irhrSrlkke-the—complex' conjugate of for- …… 
mula (30) defining the Fourier transform: 

. ._ ^j^J/ue^j^dx. _. _ ： _ .… 

The right side can be written as RFIT, so 

Fw = RF«. (40) 

Since the Fourier transform-is-its own transpose 5 - * 

(Fw,u) = (w,Ft;). 

Now set v = Fw; using (40), we get 

(F«,FV) = («, RF^) = (u, FEFu) = (u,u), 

since we have shown in the course of proving theorem 20 that FRF = I. This proves 
the L 2 isometry of the Fourier transform acting of the class S. We get the result for 
any u in L 2 by approximating u with a sequence of functions of class S. □ 

Exercise 23. Prove that u, defined aslhe L? limit satisfies (33). 

In words, theorem 21 says that the Fourier transform is a unitary operator that 
maps L 2 (E h ) onto itself. 

The next result is about distributions in® 1 . 

Theorem 22. Define for any a in K 1 

. Pa -^S^x ^am), ; ^(41) 

m 

summed over all integers m. 

(i) p a is a tempered distribution. 

⑼ The Fourier transform of p a is (V27t/\a\)pjj t b = In/a. 

Pyoof. Part (i) is obvious. To prove part (ii), we observe that p a is periodic with 
period a: 

TaPct 一 Pa =0. 

We take the Fourier transform of this relation; using the relation of Fourier transform 
to translation (see theorem 170, we get 

(〆 -l)p a =0. 


(42) 
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The function e- lbx 、 b = In/ci., is = 1 at all points am, m an integer. Therefore it 
follows from the definition (41) of p a that 

e l ^ X Pa = Pci- 

Take the Fourier transform of this relation; using again theorem 17, we get 

TbPa = Pa, (43) 

that is, that p a is periodic with period b. 

Let m(|) be any function of class S that is zero at all integer multiples of 厶： 

u(nb) = 0, n in Z. 

Such a function is divisible by the function e ia ^ — 1: 

u^)=e ia ^ l v^), (44) 

v in S. Therefore for such a u 

(u,Pa) = W 神 二 U) = ([ { e ia ^-1)p a ) =0, (45) 

where in the last step we have used (42). 

It follows from (45) that if u{nb) = 0 for |/i| > N, then the value of («, p cl ) 
depends only on the values of u(nb), | ； 2 j < N. Since this dependence is linear, 

(«, Pa) = c fl u(nb). (46) 

\n\<N 

Since, by (43), p a is ^-periodic, it follows that all the c n are equal; we denote their 
common value by c: 

(m ，戶 ) = c I] « ⑽)， (47) 

where c = c(a) depends on a. 

Any u in 5 can be approximated in .the topology of S by a sequence of functions 
un such that u^(nb) = 0 for |n| > N. Setting w = w/y in (47) and letting N oo 
we conclude that (47) holds for all u in S. We can restate this result simply in the 
notation of (41) as 

- m I 2?r 

--- Pa^.. --.&=_==：. ....(48X 

a 

Take the Fourier transform of both sides. Since P a is an even function, the Fourier 
transform of p a is P a , and so we get 

A = c ( a )pb- (48 ’） 

Interchanging a and b in (48), pb = c(b)P a . From this and (48)’ we deduce that 
c(a)c(b) = 1 for ab = In. For a = = V2tv we get c(^/2n) 2 = 1, so c(V2tt) is 

either 1 or — 1. 






The definition of the Fourier traiisform of a tempered distribution such as P a is 
that for all functions u of class S, (5, P a ) = (m, p a ). Using the definition (41) of P a 
and formula (47) for p a , we get, with 办 = 2jc/a, 

u(am) = c ⑷ ^2 ^(bn). (50) 

m n 

Given u and any number define w r by u r {x) /20^)rThe FOurigf tfSMf6ftif(5f 一 ' 

u r is — . . . • — • • -— 

«r(^) = J u{rx)e^ x dx = J = .( 备 )- 

Setting u r into (49) in place of u gives " 

rj (~m)=c(a)J2 u(rbn). (49 ’） 

« 7 ' m r n 

We can use (49) with a replaced by a/r, and b replaced by rb, to express the left 
side of (49’）as 


c(a/r) 

— "~FT 

Since this equals the right side of (49’)，we deduce that {l/\r\)c{a/r) = c(a). Setting 
r = a/Jlit weget (v^7r/|a|)c(v^r) = c(a). Since we have shown that c(v^)= 
士 1, we deduce that 




We claim that the correct sign is the positive one. For take any positive, even func¬ 
tion v of class S, and define w as u * u. Then u is positive, and since the Fourier 
transform of the even function v is real, u — y/lnxP- too is positive. So it follows 
from formula (49) that c{a) is positive as well. This completes the proof of theo¬ 
rem 22. □ 


Setting c = ^/2n/\a\ into (49) yields 


Poisson Summation Formula. For eveiy function u of class S and for all real a, 

P ㈣ =#?“(&)• (50) 


Poisson’s formula is valid for a much wider class of functions than S. 
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NOTE. In chapter 30, section 30.6. we have derived Poisson’s formula as a spec 
case of the trace formula applied to the convolution operator. We used there a diff 
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Exercise 24. 

(a) Show that the dual of a lattice is a lattice. 

(b) Show that = L. 

(c) Show that for every lattice L and every invertible linear map A. otW. 冰 

r A T\! ^ Dr/ ...I_ n / ^ -1 x/ r 



Let L be a latticerdefiote by p L the tempered'distrfbutloF. . ——— 

PL= S(x - (52) 

• _ am L . 

Theorem 24. The Fourier transform of pi is 

PL = c (L)P27rU ， (53) 

w ’ iere C (L) = (27r) /7 / 2 /|det A|, and A is a matrix that appears in the representation 
ofL of form (51). 

Exercise 25. Prove theorem 24 by imitating the steps that went into the proof of 
theorem 22. 乂 

Exercise 26. Formulate Poisson’s summation formula for lattices in R n . 

B.6 APPLICATIONS OF THE FOURIER TRANSFORM 

The Fourier transform is used in a large part of mathematics to solve, or at least 
reformulate, problems. Here are a few examples. 

Liou ville 5 s Theorem. Let f (z) be an analytic function defined in the whole complex 
plane that is of polynomial growth: 

l/(^)i< const. (1 + \z\) M . 

Then f is a polynomial of degree < M. 

Proof. An analytic function satisfies the Cauchy-Riemann equation 

%/ = Udjr-i dy)f = 0. (54) 

A function / of polynomial growth is a tempered distribution; so are its derivatives. 
The Fourier transform of (54) is 


(巧 + 77 )/(^, ?j) = 0 . 
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Exercise 28. 

(a) Show that for any there is a constant such tha t \b n \ < co nst. 

(b) Show that the partial sums of the Fourier series of w, u k = b n e in0 , 
converge to u in the C N topology. 

(c) Let ^ be a distribution. Its Fourier coefficients a l} are defined by 

. . a n = (e^ ine , i)/2n ： .—. 

Show that for any C°° function w, 

•- - -(";-名)-= > : - - — . . 

(d) Let {a n } be a sequence of complex numbers that satisfy 

Wn\ < const. \n\ N 

for some 从 Show that the are the Fourier coefficients of some distribution 
onS 1 . 
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ZORN，S LEMMA 


Zorn’s lemma is a theorem in the Zermelo-Fraenkel system of set theory. It is equiv¬ 
alent logically with the axiom of choice. Thus its use introduces a highly noncon¬ 
structive step; therein lies its power. 

Zorn’s lemma deals with partially ordered sets, nonempty sets where an order 
relation a < bis defined for some pairs of elements in the set, which satisfies 

(i) transitivity: if a <b and b <c, then a <c. 

(ii) reflexivity: a <afox all a in the set, 

A subset of a partially ordered set is called totally ordered if for every pair j ，y of 
elements in it, either ;c <y or y <x. 

An element u of a partially ordered set is said to be an upper bound of a subset if 
x < u for every element x in the subset. 
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Principle of uniform boundedness, 103 


Scalar product, 52 
nondegenerate, 59 
Scattering theory, 273,477 
inverse problem, 273 
operator, 491 
potential, 490-491 
reflection coefficient, 273 
Schrodinger operator, 402,414,440,492 
Schwarz 
inequality, 52 
reflection principle, 500 
Selberg trace formula, 349 
Self-adjoint operators, 377-378, 396 
essentially self-adjoint, 398 
perturbation，406 
Semigroup, 416 
approximation, 427 
generation, 424, 440 
spectral theory, 434 
strongly continuous, 418 
transpose, 438 
uniformly continuous, 417 
weakly continuous, 424 
Sesquilinear, 52 
Singular value, 330 
Sobolev 

inequality, 42,550 
norm, 40,298 
space, 40 

Spectral mapping theorem, 200 
Spectral multiplicity, 366-370 
Spectral radius, 195,277 

_CkifandIsionnula,Jj95 ： _ 

Spectral representation 
symmetric operators, 36t370 
Spectral resolution 
compact maps, 238-242 
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Spectrum, 194, 238, 241，253, 256 
absolutely continuous, 364 
compact operator, 238,316 
point spectrum, 238-242,486 
symmetric operator, 356-357 
unbounded operator, 419 
Stochastic integral operator，256 
Stone-Weierstrass theorem. 126 



analyticity, 111 
convergence, 99 

convergence of sequences of maps, )65 
solution, ] 15 

'topology of Unear maps, 165 
Support function, 83 

Tauberian theorem, 220 
Three lines theorem, 178 
Titchmarsh convolution theorem， 280, 523-525 
Toeplitz operator, 306-312 
matrix, 307 
symbol, 306 

Trace, 267,305, 329 -330, 333 - 334 



formula, 267,329,334-341 

norm, 330,479 _ 

Translation representation, 448-450 
Transpose, 163 



Wave equation, 187,447,454 
automorphic, 505 
hyperbolic, 505 
incoming data, 454 
Wave operator, 480-489 
invariance, 490 
Weak 



compactness, 121 
convergence, 99 

convergence of sequences of maps. 
165 - 

** sequential closure, 118 
sequential compactness, 104 
solution, 115 
topology, 118 

topology of linear map, 165 
Weak* 

compactness’ 120 
convergence, 105 
sequential compactness, 106 
topology, 118 

Wiener-Hopf operators, 312 
Winding number, 307 

Zaremba’s inequality, 65 





